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(57) Abstract: The present invention relates to a method for the detection of the allelic slate of n single nucleotide polymorphism 
(SNP)-loci on genomic DNA comprising contacting said genomic DNA containing SNP-loci with n detector oligonucleotide sets 
(DO-sets). 
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Ligation-based method of analysis of single nucleotide polymorphisms on 

genomic DNA. 

The present invention relates to a method for the detection of the allelic state of n 
single nucleotide polymorphism (SNP)-loci on genomic DNA comprising the steps of 
(a) contacting said genomic DNA containing SNP-loci with n detector oligonucleotide . 
sets (DO-sets), wherein n represents the number of specific loci to be tested, and 
wherein each DO-set represents a specific collection of (aa) a first detector 
oligonucleotide (DO), wherein said first DO comprises, in 5' to 3' orientation, a 
Region I comprising a DNA sequence for PCR amplification; a Region II, or 
alternatively, II' for allele discrimination; and a Region III, for locus specific 
hybridisation, wherein said Region III comprises a DNA sequence which corresponds 
to the part of the sequence-of- the- SNP locus located upstream- of the polymorphic 
single nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the first allelic state or, 
alternatively, for the second allelic state of said SNP-locus; and (ab) a second DO, 
wherein said second DO comprises, in 5' to 3' orientation, a Region I' corresponding 
in sequence to Region I of the first DO; a Region II', or alternatively, Region II for 
allele discrimination, wherein, when Region II is comprised in the first DO, the second 
DO must comprise Region II' or, alternatively, when Region II' is comprised in the 
first DO, the second DO must comprise Region 11; and a Region 111', wherein said 
Region III' comprises a DNA sequence which corresponds to the part of the 
sequence of the SNP Ipcus located upstream of the polymorphic single nucleotide 
site and which consists, at its 3' terminus, of a single nucleotide complementary to 
the single nucleotide specific for either the second allelic state or, alternatively, for the 
first. allelic state of said SNP-locus, wherein, when said Region ill is specific for said 
first allelic state, said Region III' is specific for said second allelic state or, 
alternatively, when said Region III is specific for said second allelic state, said Region 
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III' is specific foresaid first allelic state of said SNP-locus; and (ac) a third DO, wherein 
said third DO is 5'-phosphorilated and comprises, in 5' to 3' orientation, a Region IV 
for locus specific hybridisation, wherein said Region IV comprises a DNA sequence 
which corresponds in sequence to the part of the sequence of the SNP locus located 
downstream of the polymorphic single nucleotide site and which, at its 5' terminus is 
located immediately downstream of the 3' terminus of the first or second DO; and a 
Region V comprising a DNA sequence for PCR amplification; and, optionally, a 
Region VI for locus discrimination, wherein said Region VI comprises a DNA 
sequence specific for each DO-set and which is linked at its 5' terminus to the 3' 
terminus of Region IV and at its 3' terminus to the 5' terminus of Region V; and 
wherein said Regions l/l' and 11/11' are invariable in sequence in all said first and 
second detector oligonucleotides in all DO-sets and said Region V is invariable in 
sequence in all said third detector oligonucleotides in all DO-sets, and wherein said 
Region VI is not identical in sequence in the different DO-sets; (b) carrying out a 
cyclic ligation reaction simultaneously with n DO-sets, wherein for each particular DO 
set of step (a) depending on the allelic state of the corresponding locus, either the 
first DO or the second DO or both DOs are linked to the third DO; (c) carrying out a 
first PCR-amplification reaction on the ligated products with a common first and a 
common second PCR primer* wherein s_aid_ common first PCR primer corresponds in 
sequence to Region 1/1' of the first and second detector oligonucleotide of step (a) 
and said common second PCR primer is complementary in sequence to Region V of 
the third DO of step (a); (d) separating the amplification products into n aliquots; (e) 
carrying out for each aliquot (ea) a second PCR-amplification reaction with the 
common first PCR primer of step (c) and a second PCR primer which is either 
complementary to Region IV or, optionally, to Region VI of the third DO of step (a), 
simultaneously or step-by-step with (eb) an allele discrimination detection reaction 
which detects Regions II and IT in the amplification products. 

Furthermore, the invention relates to the detector oligonucleotides and . to a kit 
comprising the detector oligonucleotides of the invention 

In this specification, a number of documents are cited. The disclosure content of 
these documents including manufacturers' manuals is herewith incorporated by 
reference in its entirety. 
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According to the formal definition, a Single Nucleotide Polymorphism (SNP) 
represents a DNA sequence variant of a single base pair, with the minor allele 
occurring in more than 1% of a given population. In principal, SNPs may be four- 
allelic, because any nucleotide position may exist in four allelic variants (the number 
of different bases in genomic DNA). However, the overwhelming majority of SNPs is 
biallelic, therefore the general task of SNP detection is often formulated as 
recognition of biallelic markers. 

Compared to other genetic markers, SNPs cover genomes with a very high density. 
For example in the human genome SNPs have been estimated to occur 
approximately every thousand nucleotides (Li and Sadler 1991). In addition, SNPs 
have a very low mutation rate per generation (10" 8 ) and thus may serve for studying 
of molecular evolution (Crow 1995; Li at al. 1996). SNP markers may be used for 
person identification in criminalistics and for selection of individual therapy in 
medicine and pharmaceutics. Besides, SNP genotyping is extremely valuable for the 
identification of genes responsible for particular traits and disorders in medicine and 
agriculture (Brookes 1999), 

The broad field of applications of SNPs induced a great interest and a pressing need 
for effective instruments of SNP discovery (search for new SNPs) and detection 
(recognition of already known SNPs). Nowadays, the limiting factor for using of SNP 
genotyping in medicine, science and industry is the efficiency of the existing 
technologies (reliability, price and throughput). Existing approaches are adequate for 
analysis either of several loci in a large number of samples: TaqMan (Holland et al. 
1991; Livak et al. 1995), Invader (Lyamichev and Neri 2003), MALDI-based methods 
(Tost and Gut 2002), Amplifluor (Myakishev et al. 2001; Hawkins et al. 2002), DASH 
(Howell et al. 1999; Prince et al. 2001), SNaPshot (Applied Biosciences, PE 
Corporation); or a large number of loci in few samples: microarray-based (Affymetrix, 
Inc.; Wang et al. 1998) or padlock-based (Hardenbol eal. 2003) genotyping. 
However, many applications, especially the association studies of polygenic traits 
require the scoring of hundreds of thousands of SNP in thousands of individuals. 
Such studies would cost in the range of 10 7 $ and are too expensive to be routine 
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now. Significant improvement of the available techniques or development of new 
technologies is required not only to make the present research cheaper, but also to 
open previously inaccessible areas. The existing methods have the following 
disadvantages that restrict their application: 

(i) necessity to adjust conditions of individual SNP-genotyping reactions 
(TaqMan, Invader, DASH); 

(ii) requirement for the preliminary amplification of SNP-containing regions of 
genomic DNA (MALDI-based methods, DASH, SnaPshot, microarray- 
based methods); 

(iii) high set-up price (TaqMan, microarray-based and padlock-based 
methods); 

(iv) complex experimental procedure which is difficult to automate (SnaPshot, 
microarray-based and padlock-based methods); 

(v) requirement for very expensive equipment (MALDI-based methods). 

As discussed above, the methods of the prior art for analysis of single nucleotide 
polymorphisms manifest substantial disadvantages. Thus, there was a need for 
means that overcome said disadvantages. The technical problem underlying the 
present invention was therefore to. provide such means. 

The solution to said technical problem is achieved by providing the embodiments 
characterized in the claims. Accordingly, the present invention relates to a method for 
the detection of the allelic state of n single nucleotide polymorphism (SNP)-loci on 
genomic DNA comprising the steps of (a) contacting said genomic DNA containing 
SNP-loci with n detector oligonucleotide sets (DO-sets), wherein n represents the . 
number of specific loci to be tested, and wherein each DO-set represents a specific 
collection of (aa) a first detector oligonucleotide (DO), wherein said first DO 
comprises, in 5' to 3 r orientation, a Region I comprising a DNA sequence for PCR 
amplification; a Region II, or alternatively, 11' for allele discrimination; and a Region III, 
for locus specific hybridisation, wherein said Region III comprises a DNA sequence 
which corresponds to the part of the sequence of the SNP locus located upstream of 
the polymorphic single nucleotide site and which consists, at its 3' terminus, of a 
single nucleotide complementary to the single nucleotide specific for either the first 
allelic state or, alternatively, for the second allelic state of said SNP-locus; and (ab) a 
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second DO, wherein said second DO comprises, in 5' to 3' orientation, a Region I' 
corresponding in sequence to Region I of the first DO; a Region IT, or alternatively, 
Region II for allele discrimination, wherein, when Region II is comprised in the first 
DO, the second DO must comprise Region IT or, alternatively, when Region II' is 
comprised in the first DO, the second DO must comprise Region II; and a Region III', 
wherein said Region III' comprises a DNA sequence which corresponds to the part of 
the sequence of the SNP locus located upstream of the polymorphic single 
nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the second allelic state or, . 
alternatively, for the first allelic state of said SNP-locus, wherein, when said Region III 
is specific for said first allelic state, said Region III' is specific for said second allelic 
state or, alternatively, when said Region III is specific for said second allelic state, 
said Region III' is specific for said first allelic state of said SNP-locus; and (ac) a third 
DO, wherein said third DO is 5'-phosphorylated and comprises, in 5' to 3' orientation, 
a Region IV for locus specific hybridisation, wherein said Region IV comprises a DNA 
sequence which corresponds in sequence to the part of the sequence of the SNP 
locus located downstream of the polymorphic single nucleotide site and which, at its 
5' terminus is located immediately downstream of the 3' terminus of the first or 
.second. QO; and a Region V.. comprising, a. DNA sequence, for PCRamplification; and, 
optionally, a Region VI for locus discrimination, wherein said Region VI comprises a 
DNA sequence specific for each DO-set and which is linked at its 5' terminus to the 
3' terminus of Region IV and at its 3' terminus to the 5' terminus of Region V; and 
wherein said Regions 1/1' and 11/11' are invariable in sequence in all said first and 
second detector oligonucleotides in all DO-sets and said Region V is invariable in 
sequence in all said third detector oligonucleotides in all DO-sets, and wherein said 
Region VI is not identical in sequence in the different DO-sets; (b) carrying out a 

» 

cyclic ligation reaction simultaneously with n DO-sets, wherein for each particular DO 
set of step (a) depending on the allelic state of the corresponding locus, either the 
first DO or the second DO or both DOs are linked to the third DO; (c) carrying out a 
first PCR-amplification reaction on the ligated products with a common first and a 
common second PGR primer, wherein said common first PCR primer corresponds in 
sequence to Region 1/1' of the first and second detector oligonucleotide of step (a) 
and said common second PCR primer is complementary in sequence to Region V of 
the third DO of step (a); (d) separating the amplification products into n aliquots; (e) 
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carrying out for each aliquot (ea) a second PCR-amplification reaction with the 
common first PCR primer of step (c) and a second PGR primer which is either 
complementary to Region IV or, optionally, to Region VI of the third DO of step (a), 
simultaneously or step-by-step with (eb) an allele discrimination detection reaction 
which detects Regions II and II' in the amplification products. 

The term "single nucleotide polymorphism (SNP)" is used in accordance with the 
present invention in a looser sense, than in the formal definition: "SNP represents a 
natural substitution of one DNA base to another, the least common allele occurring in 
more than 1% of a given population". In the present invention SNPs are referred to as 
being short allelic variants, i.e. substitutions or small insertions-deletions, wherein 
minimum allele frequencies for the polymorphisms are not taken into consideration. 
Such loose definition is widely used. For example, the NCBI dbSNP database 
(httD://www.ncbi.nlm.nih.aov/SNPA also uses said SNP definition, which does not 
take into consideration allelic frequencies. 

The term "locus specific hybridisation" as used in connection with the present 
invention means specific hybridisation of the detector oligonucleotides with the 
definite SNP-containing region in the genomic DNA. 

The term "allelic state" used in accordance with the present invention means one of 
several existing variants of the nucleotide sequence of the particular DNA region. 

The term "allele discrimination" refers to the recognition of the allelic state of the 
particular SNP locus. 

In accordance with the present invention, the term "cyclic ligation reaction" refers to 
the ligation reaction performed with thermostable DNA ligase where the temperature 
of the reaction is periodically increased above the melting temperature of the DNA- 
substrate. 

The present invention relates to a high throughput method of SNP genotyping, 
particularly suitable for the analysis of a large number of SNP's (some hundreds) in a 
large number of samples (thousands). The method of the invention is based on a 
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ligation detection reaction (LDR) performed directly on genomic DNA. During ligation 
the biallelic state of the SNP locus is converted into a bimarker state of ligated 
detector oligonucleotide. The detector oligonucleotides of the present invention have 
a special design, which allows amplifying them and determining the state of the allelic 
marker. The procedure 

(i) uses a standard protocol for any SNP locus; 

(ii) is performed directly on the genomic DNA without preliminary PCR 
amplification; 

(iii) has an acceptable set-up price; 

(iv) includes a few simple pipeting steps; 

(v) requires common laboratory equipment. 

Easy experimental procedure and the standard protocol for all SNP loci permits to 
perform the method of the invention on automated liquid-handling system. 

The present invention relates to a new ligation detection reaction (LDR) -based 
method of SNP genotyping. The specific design of the detector oligonucleotides of 
the invention permits to carry out the method of the invention af high sensitivity, 
which means that it can be performed directly on genomic DNA. The method of the 
invention comprises only a minimum number of steps and is the same for different 
SNP loci, i.e. no locus-specific optimization is required. Several SNP loci may be 
analysed simultaneously on the same sample of genomic DNA. 

The ligation detection reaction (LDR) is based on the faster ligation of matched 
compared to mismatched nicks (Figure 1). It is one of the most promising approaches 
for the analysis of SNP variants because it is more accurate than hybridisation 
methods and permits simultaneous analysis of several SNP loci in one reaction 
mixture. Some other LDR-based SNP genotyping methods which are sensitive 
enough to be performed on genomic DNA were described previously: Barany et al. 
2001; Faruqi et al. 2001; P. Lizardi et al. 1998; Hardenbol 2003. Barany et al. (2001) 
describe an LDR-PCR protocol for parallel analysis of several SNP loci on genomic 
DNA. However, additional separation procedures such as electrophoresis or 
microarray hybridisation are required for analysis of LDR-PCR reaction. The method 
of Faruqi et al. (2001) uses activated fluorescent labels (Amplifluor primers) and 
permits to detect the result of genotyping reaction in closed tubes. This method, 
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however, cannot be used fpr simultaneous analysis of several SNPs, and involves 
expensive padlock probes. Hardenbol et al. (2003) managed to perform simultaneous 
padlock-based genotypings of about 1000 loci. This procedure, however, also 
exhibits substantial disadvantages. The use in Hardenbol et al. of padlock probes for 
ligation detection excludes the possibility of cyclic ligation - at least, without 
preliminary fragmentation of the genomic DNA - and thus restricts the sensitivity of 
the procedure. Furthermore, gap filling reaction are performed in separate tubes 
which, together with the use of several microarrays for one genotyping reaction 
decreases the reproducibility of the assay. Additionally, the detection procedure in 
Hardenbol et al. is very complex, includes several enzymatic steps, and is 
problematic for automation. 

The general design of the detector oligonucleotides of the present invention is shown 
in Figure 2. Three DOs are used for each biallelic locus. These DOs have distinct 
regions for (i) amplification, (ii) allele and (iii) locus discrimination. Such design 
requires relatively long oligonucleotides, but as will be shown below DOs have a 
distinct block structure and may be cost-efficiently produced by ligation-based 
synthesis (Borodina et al. 2003). 

The design of detector oligonucleotides with separate regions may have some 
variations for different detection schemes. Features in the DOs, which are conserved 
in any case, are: 

• Regions for amplification (I & V) which are constant for all DOs. They are 
located such that amplicons appear only as a result of ligation. If no ligation 
occurs, no amplicons are generated. 

• Two different allele-discrimination regions (II and IT), which are used for 
biallelic loci. They are located in the same DOs as the allele-specific regions 
(III). If the locus is three- or four-allelic, then three or four allele-discrimination 
regions should be used. 

The scheme of the genotyping reaction is shown in Figure 3. In step (1) ligation 
detection reaction with n DO sets is performed on the same sample of genomic DNA. 
Depending on the allelic state of SNP-loci particular allele-discrimination Regions (II 
or IP) are included in ligated pairs of detector oligonucleotides: Region II for locus_01 
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and Region II' for locus_n. Thus, the biallelic state of SNP loci is converted into the 
bimarker state of ligated DOs. The first marker is Region li and the second marker is 
Region IT (shown black and white in Figure 3). 

The marker state of each particular SNP locus is determined in separate amplification 
reactions with one common and one locus-specific PCR primer (step 3). Locus- 
specific primers (#PCR_01r, ... , #PCR_nr) are used to distinguish single amplicon 
from a complex mixture. An intermediate preamplification step 2 is introduced into the 
protocol to prevent decrease of sensitivity due to the division of the ligation mixture 
into a number of separate aliquots ("n" in Figure 3). 

The use of the specialized locus-discrimination Region VI in the design of the DOs of 
the invention is advantageous, because the same "n" locus-specific PCR-primers are 
used in different "n-loci" DO-sets. This makes the second amplification reaction more 
reliable and reproducible. The only locus-specific components are then locus-specific 
regions which participate in the ligation reaction. However (as demonstrated in 
Example 2 below) the locus-specific Region IV may be used instead of Region VI for 
locus discrimination. This substitution is advantageous for SNP-detection projects 
with a low number of SNP-loci* because exclusion of Region VI leads to shortening of 
"right" DOs. 

As was shown in Amplification Fragment Length Polymorphism (AFLP) - 
experiments the initial ratio of hundreds of amplicons is preserved when amplification 
is performed with common primers (Vos et al. 1995). The design of the DOs of the 
invention permits to use two common primers for the amplification of all ligated DOs 
on the step 2 and thus to keep the initial ratio of different amplicons during PCR. Also 
on the step 3 in each separate tube only two primers are used for amplification of 
both allelic variants. 

When closed tube detection is performed in step 3 of Figure 3 (as in Example 2) the 
whole procedure involves only three pipeting steps: (i) addition of genomic DNA to 
the Ligation Premixture; (ii) addition of the First PCR Mix to the ligation reaction; (iii) 
aliquoting of the amplified material to the tubes with the Second PCR Mixtures. 
These simple operations may be performed by automated liquid-handling system. 
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The possibility of simultaneous analysis of a number of SNP loci on the same portion 
of genomic DNA makes the method of the invention an adequate basis for clinical 
assays where only restricted amount of biological material is available for the 
analysis. 



In a preferred embodiment of the method of the present invention, the first PCR 
amplification reaction of step 1(c) is performed simultaneously or step by step with 
the allele discrimination reaction of step 1(eb) and steps 1(d) and 1(ea) are omitted. 

With this preferred embodiment of the invention, it is possible to test only one single 
SNP locus. The scheme of the genotyping reaction for the detection of one single 
SNP locus is shown in Figure 4. This is a two-step reaction. First, depending on the 
allelic state of SNP-locus, a particular allele-discrimination Region (II or IT) is included 
in ligated pairs of DOs: Region II in case of Allele 1 and Region ir in case of Allele 2. 
The marker status of the locus is then determined in the amplification reaction with 
common primers. 

If closed tube detection is used on step 2 of Figure 4 (as in Example 1), the method 
of said preferred embodiment of the invention involves two pipeting steps: (i) addition 
of genomic DNA to the Ligation Premixture; (ii) addition of universal PCR Mix to the 
ligation reaction. Neither step (i) nor step (ii) operate with amplified DNA. Said 
method is therefore safe in terms of cross-contamination, and can be applied for 
clinical and criminalistic applications. Furthermore, it may be advantageous for 
genotyping projects involving analysis only of a few loci in large number of samples. 

In another preferred embodiment of the method of the present invention a first 
oligonucleotide labelled with a first detectable label and able to hybridize with Region 
II and a second oligonucleotide labelled with a second detectable label and able to 
hybridize with Region IT are used in the allele discrimination reaction of step 1 (eb). 

In a more preferred embodiment of the method of the present invention said first and 
second detectable labels are two distinguishable fluorescent labels and their 
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fluorescence is suppressed by quenchers located in close vicinity to said fluorescent 
labels and the presence of Region II in the amplification products in the allele 
discrimination reaction of step 1 (eb) induces activation of the first fluorescent label 
but does not activate the second fluorescent label and the presence of Region II' in 
the amplification products in the allele discrimination reaction of step 1 (eb) induces 
activation of the second fluorescent label, but does not activate the first fluorescent 
label. 

This embodiment of the method of the present invention is carried out as a closed- 
tube detection assay. The term "closed tube detection" refers to the methods of 
analysis of reaction without contacting the reaction solution by some probe and 
without taking aliquots for examination. Commonly, such detection is performed by 
optical methods such as, for example, measurement of adsorbtion, fluorescence, 
fluorescence polarization. 

In a further preferred embodiment of the method of the present invention both said 
quencher and said first fluorescent label are covalently attached to the first 
oligonucleotide and both said quencher and said second fluorescent label are 
covalently attached to the second oligonucleotide and spacial separation is 
performed by Taq polymerase 5'-exonucIease digestion of the first oligonucleotide 
hybridized to Region II or the second oligonucleotide hybridized to Region II \ 

This preferred embodiment of the present invention makes use of the method of 
close-tube detection with fluorescent labels and 5' exonuclease assay (TaqMan: 
Holland et al. 1991; Livak et al. 1995) 

In another preferred embodiment of the method of the present invention both said 
quencher and said first fluorescent label are covalently attached to the first 
oligonucleotide and both said quencher and said second fluorescent label are 
covalently attached to the second oligonucleotide and each of the first and second 
oligonucleotide contains two short complementary regions and said first or second 
fluorescent label and said quencher are brought together when said complementary 
regions are hybridised with each other, and the separation of said first or second 
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fluorescent label from the quencher is due to change of spacial conformation of the 
first or second labelled oligonucleotide after hybridization with Region II or IT. 

This preferred embodiment of the present invention makes use of the method of 
close-tube detection with fluorescent labels and Molecular Beacons (Tyagi et al. 
1998). 

In yet another preferred embodiment of the method of the present invention said first 
and second oligonucleotides contain covalently attached fluorescent labels and said 
quenchers are attached to additional oligonucleotides and said additional 
oligonucleotides are complementary to said first and second oligonucleotides and 
said first or second fluorescent label and said quencher are brought together when 
said additional oligonucleotides are hybridised with said first and second 
oligonucleotides and said quencher is separated from said fluorescent label when 
said first and second oligonucleotides hybridize with Regions II or II'. 

This preferred embodiment of the present invention makes, use of the method of 
close-tube detection with fluorescent labels and composite FRET primers (Li et al. 
2002), 

The above mentioned embodiments of the method of the present invention relate to 
different techniques of identification of the marker state and are all based on the 
Fluorescent Resonance Energy Transfer (FRET) principle. It is also possible to base 
closed-tube detection on other principles, for example on fluorescence polarisation 
detection (Chen et al. 1999). 

In a preferred embodiment of the method of the present invention the allele 
discrimination reaction of step 1 (eb) is performed with a microarray detection system 
simultaneously or step by step with the first amplification reaction of step 1(c), 
wherein individual elements of said microarray are able to hybridize with locus- 
specific Regions IV or, optionally, Regions VI of amplification products and Regions II 
and IF in the amplification products are detected by hybridization with the first and 
second oligonucleotides labeled with two distinguishable fluorescent labels. 
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This embodiment of the method of the present invention includes two basic steps: (i) 
the conversion of the state of the biallelic SNP !oci into the state of a universal 
markers (Region II and IT) and (ii) the identification of the marker state. These steps 
are independent from each other and may be optimized or redesigned separately. As 
already outlined above in other terms, the design of detector oligonucleotides may 
have some variations for different techniques of identification of the marker state. An 
example of microarray-based detection procedure with locus-discriminating Region 
VI located in the "left" DO is shown in Figure 5. The first two steps (multilocus 
ligation and PCR amplification with common primers) are the same as in Figure 3. 
Optional digestion by restriction endonuclease in step 3 removes any locus-specific 
sequences from amplicons. This prevents locus-specific variation of hybridisation 
velocity. Single-stranded hybridisation probes are prepared in this step by purification 
on streptavidine paramagnetic particles. Step 4 is hybridisation of single-stranded 
probes with microarray and with two (for biallelic loci) fluorescently labelled (Cy3 and 
Cy5 in Figure 5) allele-discrimination oligonucleotides complementary to Regions II 
and II', corresponding to the first and the second oligonucleotides of the present 
invention labelled with detectable labels. Each spot of the microarray contains DNA 
fragments hybridising with particular locus-discrimination Region VI. Locus- 
discrimination sequences are the same for different />loci sets, so the same universal 
microarray is used for any genotyping. Simultaneous hybridisation of single-stranded 
amplicons with microarray and allele-discrimination oligonucleotides (sandwich- 
hybridisation) allows to distinguish both loci and allelic variants in a single 
hybridisation reaction. Loci are distinguished according to position on microarray. 
Allelic states of loci are distinguished according to fluorescent signals in 
corresponding spots (Cy3 and/or Cy5). 

In another preferred embodiment of the method of the present invention said 
common second PCR primer is destroyed after the first PCR-amplification of step 

1 (c) ■ 

In a more preferred embodiment of the method of the present invention said common 
second PCR primer contains Uridine- or Ribo-nucleotides and is destroyed by Uracile 
DNA Glycosylase (UDG) or RNase included in said second PCR-amplification 
reaction of step 1 (e). 
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» * 

The marker state of each particular SNP locus is determined in separate amplification 
reactions with one common and one locus-specific PCR primer (step 3 in Figure 3). 
Locus-specific primers (#PCRJ31r, ... , #PCR_nr) are used to distinguish single 
amplicon from a complex mixture. However, the rest of common PCR primers from 
the First PCR amplification (step 2 in Figure 3) may decrease the specificity of the 
reaction. Uridine- or Ribo-nucleotides may be introduced into the sequence of 
interfering primer to have a possibility to destroy it before the Second PCR reaction. 
The use of Uracile DNA Glycosylase (UDG) or RNase is convenient, because these 
enzymes may be included into the Second PCR Mixtures without modifying the 
experimental procedure. 

In a further preferred embodiment of the method of the present invention said 
Regions I and I' have identical sequences. 

In an additionally preferred embodiment of the method of the present invention said 
first, second and third DOs are prepared by ligation based synthesis. 

Though left and right detector oligonucleotides may be prepared by conventional 
phosphoramidite technology, their block structure makes them appropriate for 
ligation-based synthesis (LBS) (Borodina et al. 2003). In LBS the constant parts of 
oligonucleotides have to be synthesized only once. "Left" DOs are combinations of 
allele specific Regions III with two (for biallelic SNPs) conservative presynthesized 
oligos (Regions I & II or I & IF). "Right" detector oligonucleotides are combinations of 
locus specific parts (Region IV) with n conservative presynthesized oligos (Region VI 
& V). For example, LBS of 100 sets of DOs (1.5nmol - scale synthesis of three DOs 
per each locus) costs about 5000$, i.e. 50$ per one DO-set. The price is therefore 
about six fold less compared to conventional phosphoramidite technology. Due to the 
low working concentration (1nM) a single 1 .5nmol-scale synthesis provides enough 
DOs for about 300000 genotyping reactions. 

An additional advantage of ligation-based synthesis is the possibility to include at low 
cost modified bases in the common part of composite oligonucleotides, for example 
exponuclease resistant groups. Such modification may be used to remove unligated 
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DOs by exonuclease treatment after ligation detection reaction. If exonuclease- 
resistant regions are included in 5'-terminal part of the "left" DO and in the 3'-terminal 
part of the "right" DO, then unligated DOs would have one exonuclease-sensitive 
end, but ligated DOs would be protected from both ends. An example of 
exonuclease-resistant group is phosphothioate-binding. When included in 3'-end they 
protect oligonucleotides from digestion by Exonuclease I, Exonuclease III (Lutz et al. 
2001), and 3'->5' exonuclease activity of proofreading DNA polymerases (T4 DNA 
polymerase, Pfu polymerase) (Dean et al. 2001). When included in 5'-end they 
protect oligonucleotides from T7 Gene6 Exonuclease (Erdogan et al. 2001) and X- 
exonuclease . 

In a another preferred embodiment of the method of the present invention the cyclic 
ligation reaction of step 1 (b) is effected with thermostable DNA ligase, preferably Taq 
DNA ligase, Tth DNA ligase, and most preferably with Pfu DNA ligase. 

Comparisons with other commercially available thermostable ligases such as Taq 
DNA ligase, Tth DNA ligase, Ampligase and Ligase-65 showed that the Pfu DNA 
ligase manifests higher fidelity. 

* 

In still another preferred embodiment of the method of the present invention the 
concentration of DOs in each DO set is in the range of 10pM to 10nM, preferably 
1nM. 

In a further preferred embodiment of the method of the present invention only one 
cycle of ligation reaction is performed in step 1(b), preferably more than 10 cycles, 
and most preferably 1 5 to 25 cycles. 

In still a further preferred embodiment of the method of the present invention the 
cyclic ligation of step 1(b) is effected in the presence of at least 1% 
polyethyleneglycol (PEG), preferably in the presence of at least 5% PEG and most 
preferably in the presence of about 15% PEG. 
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In a more preferred embodiment of the method of the present invention said 
polyethylene glycol has a molecular weight of more than 200; preferably more than 
3000 and most preferably in the range 6000 - 8000. 

Concentration of DOs is one of the main parameters in the method of the invention. 
Too low amount of DOs leads to the suppression of the signal, whereas too high 
amounts result in increase of background. Optimal intermediate concentration of DOs 
enough for the quantitative detection and not too high to induce background are 
preferred. The inventors discovered that 0.1 - 1 nM concentrations of detector 
oligonucleotides are enough to provide quantitative hybridization with the target in 
short time (20 sec during cyclic ligation) in the presence of 15% PEG 6000. About 50 
times less DOs are required (2 - 20 pM) if long hybridization (for some hours) 
precedes the ligation. Concentration of DOs should be about ten times higher if 
hybridization is performed without PEG. The inventors could show that PEG with 
different molecular weight (from 200 to 12000) stimulates the ligation reaction 
(though PEG with Mw more then 3000 works better, than PEG 200 - 600). 

Different DOs may have different minimal concentrations because of particular 
secondary structure and sequence-specific interactions with non-target genome 
sequences. High enough concentrations are preferred for high-throughput projects to 
exclude time-consuming individual optimization. Examples 1 and 4 show, that 5 fmol 
of detector oligonucleotides in 5^1 (1nM concentration) ligation reaction give 
practicaliy the same results for different loci. 

In a another preferred embodiment of the method of the present invention the cyclic 
ligation of step 1(b) is effected with at least 2 subcycles, preferably with 3 to 6 
subcycles. 

Each cycle of the ligation reaction may include subcycles of denaturation of unligated 
DOs {65°C - 30sek, 74°C - 10sek in the Examples}. When the temperature 
increases to 74°C, the majority of mismatched DOs get off the template and the free 
templates may be occupied by matched DOs. Ligated DOs remain hybridized 
because they are longer and have a higher melting temperature. The subcycles (i) 
therefore raise the chance of matched DOs to occupy the template and be ligated, 
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and thus increase the yield of the reaction by about two fold; (ii) decrease the time 
when mismatched DOs may be ligated and as a result, misligation is decreased. 

The invention also relates to a detector oligonucleotide (DO) selected from the group 
consisting of (a) a first DO which comprises, in 5' to 3' orientation, a Region I 
comprising a DNA sequence for PCR amplification; a Region II, or alternatively, II' for 
allele discrimination and a Region III, for locus specific hybridization, wherein said 
Region III comprises a DNA sequence which corresponds to the part of the sequence 
of the SNP locus located upstream of the polymorphic single nucleotide site and 
which consists, at its 3' terminus, of a single nucleotide complementary to the single 
nucleotide specific for either the first allelic state or, alternatively, for the second 
allelic state of said SNP-locus; and (b) a second DO, which comprises, in 5' to 3' 
orientation, a Region I' corresponding in sequence to Region I of the first DO; a 
Region II', or alternatively, Region II for allele discrimination, wherein, when Region II 
is comprised in the first DO, the second DO must comprise Region II' or, 
alternatively, when Region II' is comprised in the first DO, the second DO must 
comprise Region II; and a Region III', wherein said Region III' comprises a DNA 
sequence which corresponds to the part of the sequence of the SNP locus located 
upstream of the polymorphic single nucleotide site and which consists, at its 3' 
terminus, of a single nucleotide complementary to the single nucleotide specific for 
either the second allelic state or, alternatively, for the first allelic state of said SNP- 
locus, wherein, when said Region III is specific for said first allelic state, said Region 
III' is specific for said second allelic state or, alternatively, when said Region III is 
specific for said second allelic state, said Region III' is specific for said first allelic 
state of said SNP-locus; and (c) a third DO, which is 5'-phosphorylated and 
comprises, in 5' to 3' orientation, a Region IV for locus specific hybridization, wherein 
said Region IV comprises a DNA sequence which corresponds in sequence to the 
part of the sequence of the SNP locus located downstream of the polymorphic single 
nucleotide site and which, at its 5' terminus is located immediately downstream of the 
3' terminus of the first or second DO; and a Region V comprising a DNA sequence 
for PCR amplification; and, optionally, a Region VI for locus discrimination, wherein 
said Region VI comprises a DNA sequence specific for each DO-set and which is 
linked at its 5' terminus to the 3.' terminus of Region IV and at its 3' teminus to the 5' 
terminus of Region V; and wherein said Regions l/l' and 11/11' are invariable in 
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sequence in all said first and second detector oligonucleotides in all DO-sets and said 
Region V is invariable in sequence in all said third detector oligonucleotides in all DO- 
sets, and wherein said Region VI is not identical in sequence in the different DO-sets; 

In a preferred embodiment of the invention, said detector oligonucleotide comprises a 
nucleotide sequence as set forth in SEQ ID NOs 1 to 129. 

Furthermore, the invention relates to the use of the detector oligonucleotide of the 
invention in the allele discrimination of biallelic blocks of closely located nucleotide 
substitutions or small insertions/deletions, wherein Region III comprises a DNA 
sequence which corresponds to the first allelic state of said block and Region III* 
comprises a DNA sequence which corresponds to the second allelic state of said 
block and wherein Region III and IN' have different 3' terminal nucleotides. 

Finally, the invention relates to a kit comprising one or several detector 
oligonucleotide sets, wherein each set comprises a first, a second and a third 
detector oligonucleotide (DO), wherein said first DO comprises, in 5' to 3' orientation, 
a Region I comprising a DNA sequence for PCR amplification; a Region II, or 
alternatively, IT for allele discrimination and a Region III, for locus specific 
hybridization, wherein said Region III comprises a DNA sequence which corresponds 
to the part of the sequence of the SNP locus located upstream of the polymorphic 
single nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the first allelic state or, 
alternatively, for the second allelic state of said SNP-locus; and wherein said second 
DO comprises, in 5' to 3' orientation, a Region I' corresponding in sequence to 
Region I of the first DO; a Region II', or alternatively, Region II for allele 
discrimination, wherein, when Region II is comprised in the first DO, the second DO 
must comprise Region IT or, alternatively, when Region II' is comprised in the first 
DO, the second DO must comprise Region II; and a Region III', wherein said Region 
III' comprises a DNA sequence which corresponds to the part of the sequence of the 
SNP locus located upstream of the polymorphic single nucleotide site and which 
consists, at its 3' terminus, of a single nucleotide complementary to the single 
nucleotide specific for either the second allelic state or, alternatively, for the first 
allelic state of said SNP-locus, wherein, when said Region III is specific for said first 
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allelic state, said Region III' is specific for said second allelic state or, alternatively, 
when said Region III is specific for said second allelic state, said Region III' is 
specific for said first allelic state of said SNP-locus; and wherein said third DO is 5'- 
phosphorylated and comprises, in 5' to 3' orientation, a Region IV for locus specific 
hybridization, wherein said Region IV comprises a DNA sequence which corresponds 
in sequence to the part of the sequence of the SNP locus located downstream of the 
polymorphic single nucleotide site and which, at its 5' terminus is located immediately 
downstream of the 3' terminus of the first or second DO; and a Region V comprising 
a DNA sequence for PCR amplification; and, optionally, a Region VI for locus 
discrimination, wherein said Region VI comprises a DNA sequence specific for each 
DO-set and which is linked at its 5' terminus to the 3' terminus of Region IV and at its 
3' teminus to the 5' terminus of Region V; and wherein said Regions 1/1' and M/IV are 
invariable in sequence in all said first and second detector oligonucleotides in all DO- 
sets and said Region V is invariable in sequence in all said third detector 
oligonucleotides in all DO-sets, and wherein said Region VI is not identical in 
sequence in the different DO-sets. 



The figures show: 

Figure 1. The maximum discrimination of ligase-detection reaction is achieved, if 
SNP position corresponds to the extreme 3'-end of "left" DOs (here #1 and #2 are 
"left" DOs, #3 is "right" DO). DOs #1 and #3 are preferentially iigated in the presence 
of allele 1 (T), DOs #2 and #3 - in the presence of allele 2 (G). 

Figure 2. General design of detector oligonucleotides. Regions I and V (marked 
streaky) are used for PCR amplification. They are the same for all loci. Two common 
PCR primers (#PCR_L, #PCR_R) are shown in the lower part of the figure. Region II 
is the allele-discrimination part. Two types of sequences (marked black and white) 
are enough for biallelic loci. Region VI is the locus-discrimination part, n types of 
sequences ("type_01", "type_02", ... , "type_n") are necessary to discriminate n 
simultaneously Iigated loci. Regions III and IV are locus specific sequences 
responsible for targeting DOs to the SNP positions. 
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Figure 3. Overview of the SNP detection protocol. Step 1: multiplex ligation. Step 2: 
PCR preamplification with the common primers. Step 3: locus-specific amplification 
with one specific PCR primer. 

Figure 4. Overview of the "single locus" SNP detection protocol. Step 1: depending 
on the allelic state of the locus a particular "left" DO is ligated to the "right" DO. Step 
2: PCR amplification. 

Figure 5. Overview of microarray-based detection procedure with locus- 
discriminating Region VI located in "left" DOs. 

Figure 6. Overview of "single locus" SNP detection protocol of Example 1. Step 1: 
depending on the allelic state of the locus particular allele-discrimination region is 
included in ligated pair of detector oligonucleotides. Step 2: PCR amplification with 
TaqMan detection. Sequences of TaqMan probes coincide with the sequences of 
allele-discrimiriation regions. 

Figure 7 Overview of the SNP-detection protocol of Example 2. Step 1: depending 
on the allelic state of the locus particular allele-discrimination region is included in 
ligated pair of detector oligonucleotides. Step 2: PCR preamplification with common 
PCR primers. Step 3: TaqMan detection with universal detector oligonucleotides 
reveals the allelic state of each locus. Amplification is performed in n individual tubes. 
Only particular locus is amplified if one PCR primer is locus-specific (#PCR_01r, 
#PCR_02r, ... #PCR_nr). 

Figure 8. DOs and PCR primers for locus "01" aligned relative to the genomic 
sequence. A. Genomic sequences in the vicinity of SNP position for accessions CoI-0 
and C24. B. Roman numerals indicate structure regions of DOs. C. Oligonucleotides 
used for genotyping. Locus-specific oligonucleotides used for preparation of DOs 
(#L1_01, #L2_01, #R_01) are marked by a gray background. #L1, #L2 and #R are 
common primers. #aL1 , #aL2 and #aR are adaptors used for ligation-based synthesis 
of DOs. #fam and #tet are universal TaqMan probes. #PCR_L and #PCR_R are 
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common PCR primers. #PCR_01r is a locus-discrimination PCR primer used in 
Example 2. 

Figure 9. Scatter plots of "single locus" genotyping data shown in Tables 3 and 4. 
Fam and Tet signals are plotted on the X and Y axes respectively. Homozygous 
samples are represented as circles (C24) and squares (Col-0), heterozygous 
(C24/Col-0) - as triangles, no DNA controls - as rhombs. A. Fifteen independent 
genotyping reactions were performed for loci "18" and "20" on 5-150ng of genomic 
DNA. B. Genotyping of 30 different loci was performed on 50ng of homozygous and 
heterozygous genomic DNA. Arrows show signals for loci "18" and "20". 

Figure 10. Scatter plots of genotyping data shown in Tables 5 and 6. Fam and Tet 
signals are plotted on the X and Y axes respectively. Homozygous samples are 
represented as circles (C24) and squares (Col-0), heterozygous (C24/Col-0) - as 
triangles, no ligation controls (amplification with common PCR primers) - as rhombs. 
A. Fifteen independent genotyping reactions were performed for loci "10", "18", "20", 
and "29" on 5ng (filled icons), 17ng, 50ng and 150ng of genomic DNA. B. Genotyping 
of 30 different loci was performed on 50ng of homozygous and heterozygous 
genomic DNA. Arrows show signals for loci "10", "18", "20", and "29". Signals for 
locus "15" are not shown on the plot. 

* 

Figure 11. Influence of DO concentration (13pM, 40pM, 130pM, 400pM, and 
1300pM) on the fluorescent signal. Genotyping was performed on 50ng of Ler DNA. 
Tet and Fam signals are shown as circles and squares respectively. 

Figure 12. Practically all templates are used in each ligation cycle. 

Figure 13. 2 hours hybridization followed by short ligation. Amplification profiles 
demonstrate, that ligation is quantitative for DOs concentration >20pM without PEG 
(A) or >2pM in presence of 15% PEG 6000 (B). 
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MATERIALS AND METHODS 

30 Arabidopsis thaliana SNP loci were used in the work. Two loci ("25" and "26") 
polymorphic for accessions Columbia (Col-0) and Landsberg (Ler) are from (15). The 
other 28 loci (for accessions Col-0 and C24) were selected from the MASC-SNP 
database: http://www.mpi2-k0eln.mpg.de/masc/ (Schmid et al., 2003). 24 SNPs are 
spread evenly along the 1st chromosome of A. thaliana and 6 are distributed 
randomly on the 4th chromosome. Genomic sequences adjacent to SNP positions 
(30nt's from both sides) are shown in Table 1 . 

The list of common oligonucleotides and 30 sets of locus-specific oligonucleotides is 
given in Table 2. The fluorescent probes (with FAM or TET on 5'-ends and TAMRA 
on 3-ends) for TaqMan assay were prepared by MWG Biotech (Germany). Other 
oligonucleotides were synthesized by TIB Molbiol (Germany). 

T4 DNA ligase, Taq DNA ligase and T4 PNK were from New England BioLabs 
(USA). Pfu DNA ligase was from Stratagene (USA), Tth DNA ligase - from ABgene 
(UK), Ampligase - from Epicentre Technologies (USA) and Ligase-65 - from MRC- 
Holland. Reagents for real-time PCR (TaqMan PCR Core reagents kit, #N808-0228), 
optical plates and covers were from Applied Biosystems (USA). 

Preparation of detector oligonucleotides (DOs). 

Locus-specific oligonucleotides correspondent to Regions III and IV of DOs were 
selected so that their melting temperatures were close to 55°C and 60°C respectively 
(estimated by the Vector NTI program (InforMax, USA) for 4000pM primers and 
50mM salt). We tried to exclude sequences showing strong homology to other 
regions of the A. thaliana genome or containing tandem repeats and long single base 
stretches. 

For ligation-based synthesis of DOs (Borodina et al. 2003) the "left" locus-specific 
oligonucleotides (#L1_01, #L1_30, and #L2_01 #L2_30) were 
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phosphorylated: 1 nmol of each oligonucleotide was incubated at 37°C for 1 hour in 
10(il of 1x T4 PNK buffer (Tris-HCI pH 7.6, 70mM; MgCI2, 10mM; dithiothreitol, 5mM) 
with 1m M ATP and 2.5u of T4 PNK. The PNK was then heat inactivated at 65°C for 
20 minutes. Phosphorylated primers were used in LBS without any purification. 
Oligonucleotide #R, which is common for all loci, was phosphorylated during the 
phosphoramidite synthesis. 

1nmol-scale ligation reactions (involving 1 nmol of each of three oligonucleotides: 
common, locus-specific and adapter) were performed for 1 hour at 20°C in 30jxl of 
1xT4 ligase buffer (Tris-HCI pH 7.5, 50mM; MgCI2, 10mM; dithiothreitol, 10mM; ATP, 
1mM; BSA, 25 p,g/ml), with PEG 6000 (15%) and 250u of T4 DNA ligase. T4 DNA 
ligase was then heat inactivated at 65°C for 15 minutes. The "right" DOs were 
phosphorylated for 30 min at 37°C directly in the ligation mixture by adding 2.5u of T4 
PNK. 

DOs were purified through denaturing PAGE electrophoresis. Corresponding bands 
were visualized by UV shadowing on printer paper (or on the DC Alufolien Kieselgel 
60F254 chromatographic plate (Merck, Germany) with a little bit higher sensitivity) 
and were cut out. DNA was eluted in 150|il of 2M LiCI04 for 1 hour at 60°C and 
precipitated with acetone as described in (Daniliuk et al. 1986). 

Detection of a single SNP locus. 

, * 

The overview of the procedure is shown in Figure 6. Ligation detection was 
performed on different amounts of homozygous or heterozygous Arabidopsis thaliana 
genomic DNA in the presence of 5fmol of each DO (one locus set per tube) in 5uJ of 
1x Pfu ligase buffer (Tris-HCI, pH 7.5 20mM; KCI, 20mM; MgCI2 10mM; Igepal, 0.1%; 
ATP, 0.01 mM; DTT 1mM) with PEG 6000 (15%) and 2u of Pfu DNA ligase. The 
thermal profile of the reaction was: 95°C - 2min; ((65°C - 30sek, 74°C - 10sek) x6 
subcycles) x20 cycles. 

15^1 of PCR mix (IxTaqMan Buffer, 400nM primers #PCR_L and #PCR_R, 300nM 
TaqMan probes #fam and #tet, and 0.5u AmpliTaq Gold polymerase) was added to 
5pJ of ligation reaction. The TaqMan assay with end-point fluorescent detection was 
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carried out on an ABI 7900HT system (Applied Biosystems, USA): 95°C - 10min, 
(95°C - 15sek, 60°C - 30sek) x40 cycles. 



The multilocus SNP detection. 



The overview of the procedure is shown in Figure 7. 30 locus-specific DO-sets were 
ligated simultaneously on genomic DNA (5fmol of each DO). 

Preamplification of the ligated products (PCR I) was carried out with common primers 
#PCR_R and #PCR_L 15jxl of PCR mix (IxTaqMan Buffer, 400nM primers #PCR_R 
and #PCR_L, and 0.25u AmpliTaq Gold polymerase) was added to 5julI of ligation 
reaction and the tube was amplified in a thermocycler: 95°C - 10min, (95°C - 15sek, 
60°C - 30sek) x10 cycles and then diluted with 180|jJ of water. 

1 jllI of the diluted PCR I was taken as a template for the locus-specific PCR, 
performed in 1 0fxl of (IxTaqMan Buffer, 400nM primers #PCR_L and one of the 
locus-specific primers (PCR_01r, PCR_30r), 300nM TaqMan probes #fam and 
#tet, and 0.25u AmpliTaq Gold polymerase). The TaqMan assay with end-point 
fluorescent detection was carried out on an ABI 7900HT system (Applied 
Biosystems, USA): 95 Q G - 10min, (95 P C - 15sek, 60 Q C - 30sek) x40 cycles. 

Determination of minimal concentration of DOs. 

4 

Genotyping on 50ng of Ler DNA was performed with 13, 67, 130, 670 and 1300 pM 
locus "25" DOs. All parameters of ligation and subsequent amplification were as 
described in protocol of detection of a single SNP locus, except for shorter annealing 
time in cyclic ligation: (95°C - 2min; 55°C - 20sek) x20 cycles. 



EXAMPLES 
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The procedures of examples 1 and 2 are outlined schematically in figures 6 and 7. In 
both cases the first step is a ligation detection reaction (LDR) converting the biallelic 
state of SNP locus into the bimarker state of ligated detector oligonucleotides (DOs). 
The state of the markers (Region II and II') is determined by a TaqMan assay during 
subsequent PCR amplification of ligated DOs. 

The same DOs are used both for single and multiplex ligation (Figure 8B). Three DOs 
are required per each biallelic locus: a common "right" DO (DOR) and two different 
"left" DOs (DOL1 and DOL2). 

The only locus-specific parts of DOs are Regions III and IV, which target DOs to 
particular SNP position. Regions III are allele-specific: they have different 3'-terminal 
nucleotides corresponding to the SNP position. 

The other parts of the DOs (Regions I, II, II' and V) are the same for the different DO- 
sets. They are used for amplification combined with the TaqMan assay. Regions I 
and V are used in PCR amplification. The sequence of Region I is identical to the 
primer #PCR_L. The sequence of Region V is complementary to the primer #PCR_R. 
Regions II and II' (shown black and white in Figure 8B) are targets for universal 
TaqMan probes (#fam and #tet in Figure 8C). Region II and II' variants are arbitrary 
assigned to SNP alleles. In both examples the "fam"-sequence is assigned to 
Columbia (Col-0) alleles and the "tef'-sequence - to C24 or Landsberg (Ler) alleles. 

The procedure requires rather long DOs (65-80nt for "left" DOs and 50-60nt for "right" 
DOs). However, the distinct block structure, i.e. combination of constant and variable 
parts, allows their cost-effective generation by ligation-based synthesis (LBS) 
(Borodina et al. 2003). The alignment of the oligonucleotides involved in LBS of DO- 
set for the locus "01" is shown in Figure 8C. The LBS procedure utilizes T4 DNA 
ligase to join the ends of the presynthesized oligonucleotides: #L1 and #L1_01; #L2 
and #L2_01; #R1 and #R_01. Since T4 DNA ligase requires a double-stranded DNA 
substrate, these pairs are annealed to adapter oligonucleotides #aL1, #al_2 and #aR 
respectively. Locus-specific oligonucleotides #L1_01, #L2_01 and #R_01 have 5 nt 
overhangs complementary to adapters (shown bold in Table 2). 
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Example 1: Detection of a single SNP locus 

Detection of a single SNP locus (Figure 6) is a two-step reaction with end-point 
fluorescent detection. The first step is the ligation detection performed on the 
genomic DNA. Depending on the allelic state of SNP locus one of the "left" DOs is 
ligated to the "right" DO forming an amplicon with universal 3'- and 5 -ends. The 
TaqMan assay is performed in the same tube after addition of the amplification 
mixture (buffer with Taq polymerase and primers). The same amplification primers 
and TaqMan probes are used for all loci, making the technique cost efficient and 
convenient. The closed-tube fluorescent detection prevents cross-contamination. 

28 SNP loci polymorphic between A. thaliana accessions Col-0 and C24 and 2 loci 
polymorphic between accessions Col-0 and Ler were genotyped. Reproducible and 
reliable results were obtained for all 30 loci tested. 

End-point fluorescent signals for 15 independent genotypings of loci "18" and "20" 
performed on different amounts of genomic DNA (5ng - 150ng) are presented in 
Table 3 and in Figure 9A. In this graph, the signals for the different types of DNA 
(homozygous Col-0, C24 and heterozygous Col-0/C24) are located in compact areas 
and well separated from each other (Fig. 9A). 

The signals for genotyping of all loci on 50ng of DNA are plotted together in Figure 
9B (Table 4). Even taken together the signals for different loci are clearly grouped in 
three areas. As expected, the resulting dots for heterozygous DNA are more 
dispersed than dots for homozygous DNAs. However, Figure 9A demonstrates, that 
the location of dots for each particular locus is reproducible, which is essential for 
automated scoring. 

Example 2: Multilocus SNP detection 

The overview of the multilocus SNP detection protocol is shown in Figure 7. LDR is 
performed in one tube simultaneously for a number of loci. The allelic state of each 
particular locus is determined in separate TaqMan reactions with one common and 
one locus-specific PCR primers (step 3 in Figure 7). Locus-specific primers 
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(#PCR_01r #PCR_nr) are used to distinguish single amplicon from a complex 

mixture. An intermediate preamplification step 2 is introduced into the protocol to 
prevent decrease of sensitivity due to the distribution of the ligation mixture into a 
number of separate tubes for the locus-specific TaqMan assay. 

The procedure was tested for the parallel genotyping of the same 30 SNP loci 
described in Example 1. The end-point fluorescent signals for loci "10", "18", "20" and 
"29" obtained in fifteen independent genotyping on different amounts of genomic 
DNA (5ng - 150ng) are shown in Table 5 and Figure 10A. For all loci signals for 
different types of DNA (homozygous Col-0, C24 and heterozygous Col-0/C24) are 
well separated and are tightly grouped for >17ng of genomic DNA. 

The fluorescent signals for genotyping of all loci together on 50ng of genomic DNA 
are shown in Table 6 and are plotted in Figure 10B. Again, areas of different 
genotypes are reliably separated from each other. 

Some DO-sets may be incompatible with others in a parallel analysis, since some 
particular DO may serve as a template for misligation of other DOs. Among 30 loci 
tested only one locus (number "15") showed such a problem. Though the DO-set for 
locus "15" works excellently in the "single locus genotyping" (Table 4) it gives a 
"Fam/Tet" ratio about 4 in multilocus reaction on heterozygous DNA (Table 6). 

The main point of the parallel analysis is that in this format considerably less genomic 
DNA is required for genotyping of each particular locus: 17ng DNA for genotyping of 
30 loci means 0.6ng per locus, and we have clearly not reached the limit of 
multiplexing in these tests. 

Example 3. Minimal Concentration of DOs 

To estimate the minimal possible concentration of DOs in cyclic hybridization/ligation 
reaction, genotyping on 50ng of Landsberg Arabidopsis DNA was performed with 13 
- 1300 pM DOs (set "25"). In these experiments the time of hybridization/ligation was 
20 sec. All other parameters of ligation and subsequent amplification were as 
described in Example 1. Ligation detection was performed in the presence of various 
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amounts of linear DOs (13 -1300pM) in 5jal of 1x Pfu ligase buffer (Tris-HCI, pH 7.5 
20mM; KCI, 20mM; MgCI2 10mM; Igepal, 0.1%; ATP, 0.01 mM; DTT 1mM) with 15% 
PEG 6000 and 2u of Pfu DNA ligase. Thermal profile of the reaction was the 
following: (95°C - 2min; 55°C - 20sek)x20 cycles. The results are shown in Figure 
1 1 . Concentrations of DOs > 130 pM s are enough for quantitative hybridisation in 20 
sec. When 13 - 130pM concentrations of DOs are used, the signal decreases, but is 
still sufficient for the reliable detection. 

Comparison of real-time amplification profiles of (i) material obtained in 20 cycles of 
ligation on 100ng of Arabidopsis genomic DNA with DO-set "25" and (if) known 
amount of ligated DOs (40 amol) showed that practically all templates are used in 
each cycle (Figure 12). The theoretically possible amount after ligation is 88amol 
(22[amol]/1 [jag] x 100[ng] x 20[cycles] x 2[alleles]). The known amount of the ligated 
product was prepared by ligation of "left" D02_25 and "right" DO_25 (2 fmol each) on 
1 .3 fmol of template #tmpl_25 for 20 cycles of (94°C-10sec; 55°C-3min). The reaction 
was performed in 5pJ of 1xTth ligase buffer (TrisHCI 20mM, pH8.3; MgCI2 10mM; 
KCI 50mM; EDTA 1mM; NAD+ 1mM; DTT 10mM; Triton X-100 0.1%) in the presence 
of 50ng D.meianogaster genomic DNA. 

Longer hybridisation time results in decrease of the minimal concentration of DOs. 
Two hours hybridisation followed by 5min ligation was performed on the constant 
temperature (without subcycles). About the same signals were obtained for 
concentrations of DOs >20pM (without PEG) and >2pM (with 15% PEG 6000) 
(Figure 13). When 2 - 20pM concentration of DOs is used (without PEG) the signal 
decreases, but is still sufficient for the reliable detection (Table 7). Hybridisation was 
performed in the presence of 2-200pM of linear DOs in 5|il of 1x Pfu ligase buffer 
(Tris-HCI, pH 7.5 20mM; KCI, 20mM; MgCI2 10mM; Igepal, 0.1%; ATP, 0.01mM; 
DTT 1mM) with (or without) 15% PEG 6000. Thermal profile of the reaction was the 
following: (95°C - 2min; 55°C - 2hours). Then 2u of Pfu DNA ligase were added and 
the mixture was incubated 5 min at 55°C. Real time detection with SybrGreen II was 
performed on ABI 9700HT system. 
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Another conclusion from Figure 13 is that 15% PEG 6000 decreases the minimal 
concentration of detector oligonucleotides in about ten times. The same effect was 
observed for shorter hybridization times. 
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Tables. 

Table 1 



Genomic sequences of SNP loci analysed in Examples 1-3 (30nt's from both sides of 
SNP position). Sequences of loci "25" and "26" are from (Cho et al. 1999). 
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1*1* * 

LOCULS: 

» '.<"■»!***.* 

1 '. ■ ■C.tlii 
* ^ - ^ijf. 


■:.-.'Chr: 

-it. » . •• ... .. 


jit / 

: MASC: Nvnnbex" 


^iv" . ' ' . *, if^*- Genomic 'sequences;.- rC--fS " ; ;^f^ 




Chromosome 1 


r'f'l - , - 

P i' ; 

■*.** . 
f- ! - . 


0.85_M 


MASC03692 


CCCCAAAGTTCGTAGATGAAAATGGTAACA C GAGGACTTAAAACAAGAAAGTTCCTCGCAA (Col-0) 
CCCCAAAGTTCGTAGATGAAAATGGTAACA A GAGGACTTAAAACAAGAAAGTTCCTCGCAA (C24) 


\ z 


2 . 04_M 


MASC06796 


ATAGTTTGGTCTTTTATATATCACAGATTG T AGACAGGCATTACTCAAAGTGAAGCCATGT (Col-0) 
ATAGTTTGGTCTTTTATATATCACAGATTG C AGACAGGCATTACTCAAAGTGAAGCCATGT <C24) 


i 


2 . 69_M 


MASC02113 


TTTTCGAAAATTACACCAGATTCACCAAAC G CGACGCGCCTCTTCCTTCTTGGTCTTCCTC (Col-0) 
T T TT C G AAAAT T AC ACC AG ATTC ACCAAAC C CGACGCGCCTCTTCCTTCTTGGTCTTCCTC (C24) 


'■.-A H': 


4.10_M 


MASC07144 


CAGAAGCAAGAGACAAAGTCATGGACGAAG G AGAC ACAC ATT TC AT GT T GGTTT GT AGTAT (Col-0) 
NNNNN G C AAG AG AC AAAGT C ATNGACGAAG A AGACACACATTTCATGTTGGTTTGTAGTAT (C24) 


i "V" 

;, -*:-5 :; - - 

ti.T5"; r r-'-.; 


5.40_M 


MASC00817 


AGT T G AG ACCCT CTCTGGTGTCCTCGCCGG A TCTCTTGTATCCGGTGTTCAGGTAAAGTAT (Col-0) 
AGTTGAGACCCTCTCTGGTGTCCTCGCCGG C TCTCTTGTATCCGGTGTTCAGATCGCCATA (C24) 


J. , «M 


6 . 82__M 


MASC06752 


ACCTGGTTCCACCGACGCTTACAGTTATCC T CT AAAG AGT T GC AAAT AC AT ATT C AT GC T C (Col-0) 
ACCTG GTT CC AC CG ACGC T T AC AGTT ATCC A CTAAAGAGTTGCAAATACATATTCATGCTC (C24) 




7.83_M 


MASC02593 


TTTAGAAGGTAAAGACAAATGTGCTCTTGA T GGGAAACAAACCATCAAATCTGCTGAGTAA (Col-0) 
TTTAGAAGGTAAAGACAAATGTGCTCTTGA A GGGAAACAAACCATCAAATCTGCTGAGTAA (C24) 


8, ^ 

' v-i 


8 . 69_M 


MASC07163 


AAGCCAAAGGTTAGGACAAAGGTAGCAACA A AT CAGTATT C AGAAT TG G AGAT G AC ATC GT (Col-0) 
AAGCCAAAGGGTAGNANAATGGTAGCAACA C AT CAG T ATT C AGAANNNN AG ATG AC AT C GT (C24) 


- if? ■ 


9. 84_M 


MASC06722 


TATTTP Ar? Ar^TTr^Tf^TZxr^Tf^ITIkr^TTnr'fi Ar^ T f2f3r'^f2 r r r Pr^r , 2J r T | r ,f r r P'rr , nr , 'ra»rr , 'pfT'r , 'Tirnr"r'> tr>~*i rw 
irt* i i wivj/iunuj.uif«3j.uirt\alH3Wiftlj ± ubLuol ZValaAl^-i 1 Al^,l 1 bi TvaGA (COX - 0) 

T ATTTCAGAGTT GTGTAGTGTAGTTGCAAG C GGCGGTTGGATCTTTCACTATCTTGTTGGA (C24) 




10.70JM 


MASC02175 


TC GAAGACCATT GAACTGAGGCAATGTAGT C ATCATAGTGCTAGCCATTTTCTTAGTTGAT (Col-0) 
T C G AAG AC C ATT G AACTG AGGC AATGTAGT T ATCATAGTGCTAGCCATTTTCTTAGTTGAT (C24) 


'■• 11 -i ' 


11.91_M 


MASC04228 


TTAAAAATGGGATAACATTTTAGTGCATAA C AAAGTTTTAGACTTAAGGTTGAAAAAACGA (Col-0) 
TTAAAAATGGGATAACATTTTAGTGCATAA G AAAGTTTTAGACTTAAGGTTGAAAAAACGA (C24) 


f • 

: ■* * i 
p vv ■ 

' ... 


13.07_M 


MASC00364 


GACGACTGATTCCGCGAATTTCGGAGCGAC G GTGAAT GCG GT TC G AGACGT CGAC GC AGTG (Col-0) 
GACGACTGATTCAGCGAATTTCGGAGCGAC A GTGAATGCGGTTCGAGACGTCGACGCAGTG (C24) 


13' 

1 *< , 

: t- : i 


15.37_M 


MASC04127 


GAGCATCTTACAGCAGCTCAAACTTCAAAT A TCGTAGTGCTGAGCATTAAGTACCTGCGAT (Col-0) 
GAGCATCTTACAGCAGCTCAAACTTCAAAT G TCGTAGTGCTGAGCATTAAGTACCTGCGAT (C24) 


i.4'". : 

v '■ ■ 


16.35_M 


MASC06782 


TTGAGAGATACCTTGTTCTTTGAGACCGAG G GCAGATCGGACGTTTTTGCCGGGAGGAATG (Col-0) 
AAACAATGGACCTTGTTCTTTGAGACCGAG A GCAGATCGGACGTTTTTGCCGGGAGGAATG (C24) 




17.66_M 


MASC05011 


GACAAACAAGAGAAGCGGAGAAAGTGAGAC G GTAAGGATCTCTCCGGTGTCGGCTTTTGTT (Col-0) 
GACAAACAAGAGAAGCGGAGAAAGTGANAC A GTAAGGATCTCTCCGGTGTCGGCTNNNNNN (C24 ) 


.16 


18.72_M 


MASC06707 


CCTACCGCCAAGATATGCCGTAACTGAGTT T G C T ATT T AAC C AGC AAC T GT AT C T ATGT C G (Col-0) 
CCTACCGCCAGGATATGCCGTAACTGAGTT C GCTATTTAACCAGCAACTGTATCTATGTCG (C24) 


17 ' 


19.76_M 


MASC00330 


TCCT CC C AC C T C AT CAAGCTTCGTC AGGAA T GTCCACTCGGATGTCACCGTATCGTAGGAG (Col-0) 
TCCTCCCACCTCATCAAGCTTCGTCAGGAA C GTCCACTCGGATGTCACCGTATCGTAGGAG (C24) 


■.IB* 


20.75JM 


MASC07052 


GTCGTGATGTCACTTAACCATCATCAACCC T TCGAATTAGTTCCTAGAAACCTATTATATC (Col-0) 
GN CGTNGTGTC ACTNNNNNNTCATCAACCC A TCGAATTAGTNNMNNNNWNNNTNNNNNNNN (C24) 



BNSDOCID: <WO 2005026389A2J_> 



WO 2005/026389 



33 



PCT/EP2004/009546 



• 19 .•" 


21.93_M 


MASC04194 


TGATTTCTAAGCCACTGTGAGTGCCCTTGA C TTGATTATATATTCGGATTGTGTTGTTTTC (Col-0) 
TGATTTCTAAGCCACTGTGAGTGCCCTTGA T TTGATTATATATTCGGATTGTGTTGTTTTC (C24) 


.20. - : 

.1 ■ 


23.17_M 


MASC02688 


AGCAGAACGAAGAGAGTAACGAACGATGAC T GGTTT ACTTT C AAATTC AG AG AAG AAGG AA (Col-0) 
AGC AG AAC G AAG AG AGT AACG AAC G ATG AC C GGTTT ACT TTC AAATTC AGAG AAG AAG GAA (C24) 


■'■21; J ... 


25.41_M 


MASC06750 


CGAAAGAT G G T G AGTTT T TG AGAC AC AACT A GGATATTTCTTAATGGTTTGTGCCACTAAT (Col-0) 
C G AAAG AT GGTG AGTTT T T GAG AC AC AAC T G GGATATTTCTTAATGGTTTGTTGCCACTAA (C24) 




26.79_M 


MASC00629 


GTACCAAGCTCTCAAGATGTCTCTCGGTGT A TTCACCGCTTTCAGCATCGGTGTTGGGGTT (Col-0) 
GTACCAAGCTCTCAAGATGTCTCTCGGTGT C TTCACCGCTTTCAGCATCGGTGTTGGGGTT (C24) 


* ■ 

vj i * - 


28.02_M 


MASC07087 


AG AGTCAAT CC AT GTT ATTGTT CT TGGT T T C TCTGCTGCTGTCATTACAGACGTTGATATC (Col-0) 
NNANNCAATCCATGTTATTGTTCTTGGTTT T TCTGCTNNNGTCATTACAGACGTTGATATC (C24) 




29.09_M 


MASC07132 


ATCGGCTTGGGCTTACACGGTGGCGCAATT T AGCTGGCCGTTCCAGTGGAGTCAGCGGGTC {Col-0) 
ATCGGCTTGGGCTTACACGGTGGCGCAATT G AGCTGGCCGTTCCAGTGGAGTCAGCGGGTC (C24) 


Chromosome 4 




12 . 0_M 


SNP149 


AAGCAAGCGGCGAACATTTAAGACCAAACT G TTGTAAAACGTCGGGAGAAACAGAGAGCGA (Col-0) 
AAGC AAG CGGCG AAC AT TT AAGACC AAAC T A TTGTAAAACGTCGGGAGAAACAGAGAGCGA (Ler) 


: 2&:; 

I" . '% :€ 


16.2_M 


SNP200 


AT ACGAC GG AG C T AAAT CG AG ATC GT T AAA C CCTCCGAAGACTTCACGGTAGTCAAAACCT (Col-0) 
ATACGACGGAGCTAAATCGAGATCGTTAAA A CCTCCGAAGACTTCACGGTAGTCAAAACCT (Ler) 


.4 

4': , .^i 


0. 03_M 


MASC02820 


AGTTGTGGTAATAACAAAGGTAAGAGGGGT T AGCCATAGGAGCTCCAATCTGAGAGTGAGG (Col-0) 
AGT T GTG GTAAT AAC AAAGGT AAG AGGGGT C AGCCATAGGAGCTCCAATCTGAGAGTGAGG (C24) 


■'" '* r 


0.5_M 


MASC02948 


TATCTTCAACATCACATCTGTTATTTCCTC G ATAGTCGCCTTTCGGTTCTCCTCATAGTTC (Col-0) 
TATCTTCAACATCACATCTGTTATTTCCTC C ATAGTCGCCTTTCGGTTCTNNNNATAGTTC (C24) 


29" 


4 . 15_M 


MASC04 672 


TAT G C AC CAT C T AT C T CTGTT TT C AG AAT A T ATCTCCAGCTATCAATTAAAACCCCAAAAA (Col-0) 
TATGCACCATCTATCTCTGTTTTCAGAATA C ATCTCCAGCTATCAATTAAAACCCCAAAAA (C24) 


r * - v 


5 . 1_M 


MASC02658 


T T GCGG TGG AAC AGCAAAATGTTTCT AGTC G ACAACCTTGTTCAGATGTTTCGCTTCTTCT (Col-0) 
TTGCGGTGGAACAGCAAAATGTNTCTAGTC C ACAACCTTGTTCAGATGTTTCGCTTCTTCT (C24) 
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Table 2 

Sequences of oligonucleotides used in Examples 1-3 (5' H> 3' oriented). 5- and 3'- 
end pentanucleotides for preparation of DOs by ligation-based method are marked 
bold. Alignment of locus "01" DO-set is shown in Figure 8. Genomic sequences in the 
vicinity of SNPs are shown in Table 1. DOR__25 and DOR_26 were prepared by 
conventional synthesis, so they do not contain overhangs for ligation. 



. « LOCUS 






llpeijg't'h • 


Locus-specific primers 


.;..;?;>' ./liM? ■' 

: 

tip ' fa ■- : - "• 

HSJlj ::JL - • ". 

t ; , * 

; - - ■ : 


#L1 01 


ACGGGAACTTTCTTGTTTTT^AGTCCTCG 


28 


#L2 01 


ACGGGAACTTTCTTGTTTTAAGTCCTCT 


28 


#R 01 


TGTTACCATTTTCATCTACGAACTTCGGTC 


30 


•\»V . . v ** * " ■»> 

"tT"J£*V S ..1 " * . « ' 


#PCR Olr 


C GAAG T T C G TAG AT G AAAAT GG T AAC A 


27 


£ s/H-. l3> . .3 ' 
: i: : - t i ' 4 ^ . . 
!.; y .. • , :.. ^ n t 

: : ^:Vy'- : K:-:. 

I>3.!£i 

.1... .■:•£*«-] !>.% : ; ' 

,s .. :■*-«*'. tiir* • 

;_• . -.'(.I i • • 
.- — ... ,«:. .« 

- " . .•: ('• r 

-„ t; : it-)...- .r 


#L1 02 


ACGGGTTCACTTTGAGTAATGCCTGTCTA 


29 


#L2 02 


ACGGGTTCACTTTGAGTAATGCCTGTCTG 


29 


#R 02 


C AAT C T GT G AT AT AT AAAAG AC C AAAC TAT CGGTC 


35 


#PCR 02 r 


G AC C GAT AGT TTGGTCTTT TAT AT AT C AC A 


30 


■ eVhV 

* " S -ix. ' > 
■, -** I 


#L1 03 


ACGGGAATTACACCAGATTCACCAAACG 


28 


#L2 03 


ACGGGAAT T ACAC C AG AT T C AC C AAACC 


28 


#R 03 


CGACGCGCCTCTTCCTTCGGTC 


22 


#PCR 03r 


TATCCCGTTAGACCGAAGGAAGA 


23 


. ._: r, * :.• j .j... 
-* .nl 1 iU * 
, ,^»"' "j-uti' 'Hi' 

: ' * . ! 1 
1 * t - » 

4 : ="• , .^ 
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#L1 04 


ACGGGAGACAAAGTCATGGACGAAGG 


26 


#L2 04 


ACGGGAGAC AAAG T CAT G G ACG AAG A 


26 


#R 04 


AGACAC ACAT T T CAT G T T GGT T T GT CGGTC 


30 


#PCR 04r 


ACAAACCAACATGAAATGTGTGTCT 


25 


■.' ■ r.» ■» !- t' ■ ■• 
• - . - '■ 

, . i _ * *> v 1-5 


#L1 05 


ACGGGC CT GAACAC C GGAT ACAAGAGAT 


28 


#L2 05 


ACGGGT CT GAACAC C G GAT ACAAG AGAG 


28 


#R 05 


CCGGCGAGGACACCAGAGCGGTC 


23 


#PCR 05r 


GTTAGACCGCTCTGGTGTCCT 


21 . 


"i- ! ' ; -i '•■ 

•■ : » • • 

in ■ : . •. 
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Z± ...r;ij Ssi- t ' u' 


#L1 06 


ACGGGCACCGACGCTTACAGTTATCCT 


27 ' 


#L2 06 


ACGGGCACCGACGCTTACAGTTATCCA 
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CTAAAGAGTTGCAAATACATATTCATGCTCGGTC 
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G AGC AT G AAT AT G TAT T T GC AAC T C T T TAG 
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~t ;s J *r ' v 

: -* 

- Jit' 

* - " j- * * ■ Lj* r 
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ACGGGT AAAG AC7^AAT GTGCTCTT GAT 
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ACGGGT AAAGACAAATGTGCTCTTGAA 
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GGGAAACAAACCATCAAATCTGCTCGGTC 
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GCAGATTTGATGGTTTGTTTCCC 
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'•■ a ■ ; 


#L1 08 


ACGGGT T AGGACAAAGG TAGCAACAA 
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ACGGGG T AGGAC AAT G G TAG C AAC AC 
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ATCAGTATTCAGAATTGGAGATGACATCGGTC 
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GATGTCATCTCCAATTCTGAATACTGAT 
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■'" , 9...' 
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ACGGGT GAAAGATCCAACCG CCA 
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ACGGGT GAAAGAT CC AACC GC C G 
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C T T GCAAC T AC AC T AC ACAAC T C TGAAAT ACGGTC 
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TTAGACCGTATTTCAGAGTTGTGTAGTGTA 
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■ . ' • 1 " ■ . 

'. " «*, 


#L1 10 


ACGGGCATTGAACTGAGGCAATGTAGTC 
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ACGGGC AT T G AAC T G AGGCAAT G T AGT T 
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ATCATAGTGCTAGCCATTTTCTTAGTTGATCGGTC 
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T CAAC T AAG AAAAT G G CT AG C ACT AT GAT 
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CCG T T AGACCGAT GC T GAAAG 
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#L1 23 


ACGGGAATCCATGTTATTGTTCTTGGTTTC 
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ACGGGAATCCATGTTATTGTTCTTGGTTTT 
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#R 23 


TCTGCTGCTGTCATTACAGACGTTCGGTC 
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CCGAACGTCTGTAATGACAGCA 
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#L1 24 


ACGGGACACGGTGGCGCAATTT 
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#L2 24 


ACGGGACACGGTGGCGCAATTG 


22 


#R 24 


AGCTGGCCGTTCCAGTGGCGGTC 
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CCACTGGAACGGCCAGCT 
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ACGGGCGGCGT^ACAT T T AAGACCAAAC TG 
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ACGGGCGGCGAACATTTAAGACCAAACTA 
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TTGTAAAACGTCGGGAGAAACAGAGAGGTCTAACGGGATAGCGTG 
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TCTCTGTTTCTCCCGACGTTTTACAA 


26 


#tmpl 25 


CTCTCTGTTTCTCCCGACGTTTTACAACAGTTTGGTCTTi\AATGTT 
CGCCGC 
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ACGGGAGCTAAATCGAGATCGTTAAAC 
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ACGGGAG C T AAAT C GAG AT C G T T AAAA 
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DOR 2 6 


CCTCCGAAGACTTCACGGTAGTCAGTCTAACGGGATAGCGTG 
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CGGTTTTGACTACCGTGAAGTCT 


23 


e •-; : _ r. t ; _f* .-- . 
V_ . ■ '. jl' *.*■<-" ' 

k ^ II .u ? . % v^>A- i- ~ /r ' r - 

ii- "U r 5- -"tf5J"5 
^ * _ . . A\1 M :.l 

■T. . * K— ;l?f" v 

h^-ItV* - * /*ri»: C t'i J ; " 


#L1 27 


ACGGGAG ATTGGAGCTCCTATGGCT A 
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#L2 27 


ACGGGAG ATTGGAGCTCCTATGGCTG 
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#R 27 


ACCCCTCTTACCTTTGTTATTACCACACGGTC 
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TGTGGTAATAACAAAGGTAAGAGGGGT 


27 


^?* ^r-^.jrVU' 
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'V/ ' ."I ri '• " 
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#L1 28 


ACGGGAACCGAAAGGCGACTATC 


23 


#L2 28 


ACGGGAACCGAAAGGC GAC TAT G 


23 


#R 28 


GAGGAAATAACAGATGTGATGTTGAAGATCGGTC 
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#PCR 28r 


TCTTCAACATCACATCTGTTATTTCCTC 


28 


:'■;"* . |i^.. iT .ff .ti. 

. - jr • : v 
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• • . 


#L1 29 


ACGGGT T T T AAT T GAT AG C T GG AG AT A 


27 


#L2 2 9 


ACGGGT T T T AAT T G AT AGC T GG AG AT G 


27 


#R 29 


TAT T C TGAAAACAGAGAT AGAT GGT GCACGGTC 
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TGCACCATCTATCTCTGTTTTCAGAATA 
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;'^!V 'V- .*:lVt. : 
ii"-. ."-if 

if! Vju, _ ^r. , T%.'-.i 
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#L1 30 


ACGGGAACAG CAAAAT GT T T C TAG T CG 
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#L2 30 


ACGGGAACAGCAAAAT GT T T C T AGT C C 
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#R 30 


ACAACCTTGTTCAGATGTTTCGCTTCGGTC 


- 27 


#PCR- 30r 


C G AAG CG AAAC AT C T GAACAAG 


22 


Common oligonucleotides 


#L1 


T GC ACAAT T C AC GAC TCAC GAT CC AC ACGGTCTCGC AC TGGC 


42 


#aLl 


CCCGTGCCAGTGCGA 


. 15 


#L2 


TGCACAATTCACGACTCACGATCATCCGCTCCGACGACACGA 


42 


#aL2 


CCCGTTCGTGTCGTCG 


16 


#R 


TAACGGGATAGCGTGGTGGTA 

* 


21 


#aR 


GCTATCCCGTTAGACCG 


17 


#PCR L 


GCACAATTCACGACTCACGA 


20 


#PCR R 


CCACCACGCTATCCCGTTAGAC 


22 


#f am 


FAM-CCACACGGTCTCGCACTGGC-TAMRA 


20 


#tet 


TET-CATCCGCTCCGACGACACGA-TAMRA 


20 
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Table 3 

"Single locus" SNP-genotyping on different amounts of Arabidopsis genomic DNA 
(Example 1). Genotyping was performed either on homozygous (Col-0, C24) or 
heterozygous (Col-0/C24) DNA. Results from Table 4 are also included (marked by 
asterisk). 



1 Locus 


dna;- 

(ng) 4 = 


. Coi-o . 

r - * 
, * — > ( * . 


• C24 v.- • 

. * > - - N "~ ** 


Col-0l/C24 


: ™ Fam , .* 


let .= ■ 


; Fam , 


Tet ; 


, Fam 


Tet > : 


: 1 

'■a 18 ; 

* -t i". . * , 

' f £ ' ! ! ' . • ■ 

'I * * 'i7"s . "t 1 ;- 
i-. . * "'. 

*■ i v.;i • 

- rfl. 


r 150. 


2,65 


0,14 


0.43 


2,57 


2,20 


1,12 


50 


3,24 


0,12 


0,43 


2,69 


2,53 


1,42 


501 


2,54 


0,13 


0,33 


1,88 


2,50 


1,13 


- ' 17 • 


3,24 


0,12 


0,48 


2,80 


2,46 


1,15 


- ,.-5-. -r 


3,20 


0,13 


0,46 


2,84 


2,53 


1,16 


- i ' ' 

•'<:>" : " ' I" .. ■. • 

.■■ ;t- ."1';- 
. .:' , i t 'V . s. ' 

' i ?20 v ; * 

!■ v ■ 


- 150: * 


3,14 


0,16 


0,26 


2,53 


1,85 


1,74 


':.) 50 ; : 


3,20 


0,23 


0,47 


3,06 


2,22 


2,01 


: 50* .i 


2,68 


0,17 


0,34 


2,32 


2,34 


2,12 


,17 : . 


3,04 


0,26 


0,46 


2,90 


2,10 


1,84 




: .- 5 ' 


3,02 


0,35 


0,44 


2,90 


2,13 


2,80 
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Table 4 

"Single locus" SNP-genotyping on 50ng of Arabidopsis genomic DNA (Example 1). 
Genotyping was performed either on homozygous (CoI-0, C24) or heterozygous (Col- 
0/C24) DNA. "k" is control without DNA. 
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' ■ *2y- ; ". 
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1,90 
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Table 5 

SNP- genotyping on different amount of Arabidopsis genomic DNA (Example 2). 
Results from Table 6 are also included (marked by asterisk). 
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Table 6 

SNP-genotyping on 50ng of Arabidopsis genomic DNA (Example 2). Genotyping was 
performed either on homozygous (Col-0, C24) or heterozygous (Col-0/C24) DNA. "k" 
is control without ligation (amplification with common PCR primers). 
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Table 7 

Influence of DO concentration on fluorescent signal for 2 hours hybridisation. 
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Claims 

1. A method for the detection of the allelic state of n single nucleotide 
polymorphism (SNP)-loci on genomic DNA comprising the steps of 

(a) contacting said genomic DNA containing SNP-loci with n detector 
oligonucleotide sets (DO-sets), wherein n represents the number of specific 
loci to be tested, and wherein each DO~set represents a specific collection of 

(aa) a first detector oligonucleotide (DO), 

wherein said first DO comprises, in 5' to 3' orientation, a Region I .comprising 
a DNA sequence for PCR amplification; a Region II, or alternatively, IT for 
allele discrimination and a Region III, for locus specific hybridization, wherein 
said Region III comprises a DNA sequence which corresponds to the part of 
the sequence of the SNP locus located upstream of the polymorphic single 
nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the first allelic state 
or, alternatively, for the second allelic state of said SNP-locus; and 

(ab) a second DO, 

wherein said second DO comprises, in 5' to 3' orientation, a Region V 
corresponding in sequence to Region I of the first DO; a Region If, or 
alternatively, Region II for allele discrimination, wherein, when Region II is 
comprised in the first DO, the second DO must comprise Region II* or, 
alternatively, when Region IT is comprised in the first DO, the second DO must 
comprise Region II; and a Region III', wherein said Region IN' comprises a 
DNA sequence which corresponds to the part of the sequence of the SNP 
locus located upstream of the polymorphic single nucleotide site and which 
consists, at its 3' terminus, of a single nucleotide complementary to the single 
nucleotide specific for either the second allelic state or, alternatively, for the 
first allelic state of said SNP-locus, wherein, when said Region III is specific for 
said first allelic state, said Region III' is specific for said second allelic state or, 
alternatively, when said Region III is specific for said second allelic state, said 
Region III' is specific for said first allelic state of said SNP-locus; and 

(ac) a third DO, 
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wherein said third DO is 5'-phosphorylated and comprises, in 5' to 3' 
orientation, a Region !V for locus specific hybridization, wherein said Region IV 
comprises a DNA sequence which corresponds in sequence to the part of the 
sequence of the SNP locus located downstream of the polymorphic single 
nucleotide site and which, at its 5' terminus is located immediately 
downstream of the 3' terminus of the first or second DO; and a Region V 
comprising a DNA sequence for PCR amplification; and, optionally, a Region 
VI for locus discrimination, wherein said Region VI comprises a DNA 
sequence specific for each DO-set and which is linked at its 5' terminus to the 
3' terminus of Region IV and at its 3' teminus to the 5' terminus of Region V; 
and 

wherein said Regions 1/1' and 11/11' are invariable in sequence in all said first 
and second detector oligonucleotides in all DO-sets and said Region V is 
invariable in sequence in all said third detector oligonucleotides in all DO-sets, 
and wherein said Region VI is not identical in sequence in the different DO- 
sets; 

carrying out a cyclic ligation reaction simultaneously with n DO-sets, wherein 
for each particular DO set of step (a) depending on the allelic state of the 
corresponding locus, either the first DO or the second DO or both DOs are 
linked to the third DO; ' 

carrying out a first PCR-amplification reaction on the ligated products with a 
common first and a common second PCR primer, wherein said common first 
PCR primer corresponds in sequence to Region MY of the first and second 
detector oligonucleotide of step (a) and said common second PCR primer is 
complementary in sequence to Region V of the third DO of step (a); 

separating the amplification products into n aliquots; 

) carrying out for each aliquot 

(ea) a second PCR-amplification reaction with the common first PCR primer of 
step (c) and a second PCR primer which is either complementary to Region IV 
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or, optionally, to Region VI of the third DO of step (a), simultaneously or step- 
by-step with 

(eb) an allele discrimination reaction which detects Regions II and IT in the 
amplification products. 

2. The method of claim 1, wherein only one SNP locus is tested and wherein the 
first PCR amplification reaction of step 1(c) is performed simultaneously or 
step by step with the allele discrimination reaction of step 1(eb) and wherein 
steps 1(d) and 1(ea) are omitted. 

3. The method of claims 1 or 2, wherein a first oligonucleotide labelled with a first 
detectable label and able to hybridize with said Region II, and a second 
oligonucleotide labelled with a second detectable label and able to hybridize 
with said Region II' are used in the allele discrimination reaction of step 1 (eb). 

4. The method of claim 3 wherein said first and second detectable labels are two 
distinguishable fluorescent labels and their fluorescence is suppressed by 
quenchers located in close vicinity to said fluorescent labels and wherein the 
presence of Region II in the amplification products in the allele discrimination 
reaction of step 1 (eb) induces activation of the first fluorescent label but does 
not activate the second fluorescent label and the presence of Region IT in the 
amplification products in the allele discrimination reaction of step 1 (eb) 
induces activation of the second fluorescent label, but does not activate the 
first fluorescent label. 

5. The method of any one of claims 1 to 4, wherein both said quencher and said 
first fluorescent label are covalently attached to the first oligonucleotide and 
wherein both said quencher and said second fluorescent label are covalently 
attached to the second oligonucleotide and spacial separation is performed by 
Taq polymerase 5'-exonuclease digestion of the first oligonucleotide 
hybridized to Region II or the second oligonucleotide hybridized to Region II'. 

■ 

6. The method of any one of claims 1 to 4, wherein both said quencher and said 
first fluorescent label are covalently attached to the first oligonucleotide and 
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wherein both said quencher and said second fluorescent label are covalently 
attached to the second oligonucleotide and each of the first and second 
oligonucleotide contains two short complementary regions and wherein said 
first or second fluorescent label and said quencher are brought together when 
said complementary regions are hybridised with each other, and wherein the 
separation of said first or second fluorescent label from the quencher is due to 
change of spacial conformation of the first or second labelled oligonucleotide 
after hybridization with Region II or II'. 

7. The method of any one of claims 1 to 4, wherein said first and second 
oligonucleotides contain covalently attached fluorescent labels and said 
quenchers are attached to additional oligonucleotides, wherein said additional 
oligonucleotides are complementary to said first and second oligonucleotides 
and wherein said first or second fluorescent label and said quencher are 
brought together when said additional oligonucleotides are hybridised with 
said first and second oligonucleotides and wherein said quencher is separated 
from said fluorescent label when said first and second oligonucleotides 
hybridize with Regions II or II'. 

8. The method of any one of claims 1 to 3, wherein the allele discrimination 
reaction of step 1(eb) is performed wjth microarray detection system 
simultaneously or step by step with the first amplification reaction of step 1 (c) 
wherein individual elements of said microarray are able to hybridize with locus- 
specific Regions IV or, optionally, Regions VI of amplification products and 
wherein Regions II and II' in the amplification products are detected by 
hybridization with said first and second oligonucleotides labeled with two 
distinguishable fluorescent labels. 

9. The method of any one of claims 1 to 7, wherein said common second PCR 
primer is destroyed after the first PCR-amplification of step 1(c). 

10. The method of claim 9, wherein said common second PCR primer contains 
Uridine- or Ribo-nucleotides and is destroyed by Uracile DNA Glycosylase 
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(UDG) or RNase included in said second PCR-ampIification reaction of step 
1(e). 

11. The method of any one of claims 1 to 10, wherein said Regions I and I' have 
identical sequences. 

12. The method of any one of claims 1 to 11, wherein said first, second and third 
DOs are prepared by ligation based synthesis. 

13. The method of any one of claims 1 to 12, wherein the concentration of DOs in 
each DO set is in the range of 10pM to 10nM, preferably 1nM. 

14. The method of any one of claims 1 to 13, wherein only one cycle of ligation 
reaction is performed in step 1(b), preferably more than 10 cycles, and most 
preferably 1 5 to 25 cycles. 

15. The method of any one of claims 1 to 14, wherein the cyclic ligation reaction of 
step 1 (b) is effected with thermostable DNA ligase, preferably Taq DNA ligase, 
Tth DNA ligase, and most preferably with Pfu DNA ligase. 

16. The method of any one of claims 1 to 15, wherein the cyclic ligation of step 
1(b) is effected in the presence of at least 1% polyethyleneglycol (PEG), 
preferably in the presence of at least 5% PEG and most preferably in the 
presence of about 15% PEG. 

17. The method of claim 16, wherein said polyethylene glycol has a molecular 
weight of more than 200, preferably more than 3000 and most preferably in the 
range 6000 - 8000. 

18. The method of any one of claims 1 to 17, wherein the cyclic ligation of step 
1 (b) is effected with at least 2 subcycles, preferably with 3 to 6 subcycles. 

19. A detector oligonucleotide (DO) selected from the group consisting of 
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(a) a first DO which comprises, in 5' to 3' orientation, a Region I comprising a 
DNA sequence for PCR amplification; a Region II, or alternatively, II' for allele 
discrimination and a Region III, for locus specific hybridization, wherein said 
Region III comprises a DNA sequence which corresponds to the part of the 
sequence of the SNP locus located upstream of the polymorphic single 
nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the first allelic state 
or, alternatively, for the second allelic state of said SNP-locus; and 

(b) a second DO, which comprises, in 5' to 3' orientation, a Region I* 
corresponding in sequence to Region I of the first DO; a Region II', or 
alternatively, Region II for allele discrimination, wherein, when Region II is 
comprised in the first DO, the second DO must comprise Region II' or, 
alternatively, when Region II' is comprised in the first DO, the second DO must 
comprise Region II; and a Region III', wherein said Region III' comprises a 
DNA sequence which corresponds to the part of the sequence of the SNP 
locus located upstream of the polymorphic single nucleotide site and which 
consists, at its 3' terminus, of a single nucleotide complementary to the single 
nucleotide specific for either the second allelic state or, alternatively, for the 
first allelic state of said SNP-locus, wherein, when said Region III is specific for 
said first allelic state, said Region III' is specific for said second allelic state or, 
alternatively, when said Region 111 is specific for said second allelic state, said 
Region III' is specific for said first allelic state of said SNP-locus; and 

(c) a third DO, which is 5'-phosphorylated and comprises, in 5' to 3' 
orientation, a Region IV for locus specific hybridization, wherein said Region IV 
comprises a DNA sequence which corresponds in sequence to the part of the 
sequence of the SNP locus located downstream of the polymorphic single 
nucleotide site and which, at its 5' terminus is located immediately 
downstream of the 3' terminus of the first or second DO; and a Region V 
comprising a DNA sequence for PCR amplification; and, optionally, a Region 
VI for locus discrimination, wherein said Region VI comprises a DNA 
sequence specific for each DO-set and which is linked at its 5' terminus to the 
3' terminus of Region IV and at its 3' teminus to the 5' terminus of Region V; 
and wherein said Regions 1/1' and 11/11' are invariable in sequence in all said 
first and second detector oligonucleotides in all DO-sets and said Region V is 
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invariable in sequence in all said third detector oligonucleotides in all DO-sets, 
and wherein said Region VI is not identical in sequence in the different DO- 
sets; 

20. The detector oligonucleotide of claim 19, wherein said detector oligonucleotide 
comprises a nucleotide sequence as set forth in SEQ ID NOs 1 to 129. 

21. Use of the detector oligonucleotide of claims 19 or 20 for allele discrimination 
of biallelic blocks of closely located nucleotide substitutions or small 
insertions/deletions, wherein Region III comprises a DNA sequence which 
corresponds to the first allelic state of said block and Region IN' comprises a 
DNA sequence which corresponds to the second allelic state of said block and 
wherein Region III and III' have different 3' terminal nucleotides. 

22. Kit comprising one or several detector oligonucleotide sets, wherein each set 
comprises a first, a second and a third detector oligonucleotide (DO), wherein 
said first DO comprises, in 5' to 3' orientation, a Region I comprising a DNA 
sequence for PCR amplification; a Region II, or alternatively, IT for allele 
discrimination and a Region III, for locus specific hybridization, wherein said 
Region III comprises a DNA sequence which corresponds to the part of the 

« 

sequence of the SNP locus located upstream of the polymorphic single 
nucleotide site and which consists, at its 3' terminus, of a single nucleotide 
complementary to the single nucleotide specific for either the first allelic state 
or, alternatively, for the second allelic state of said SNP-locus; and 
wherein said second DO comprises, in 5' to 3' orientation, a Region I' 
corresponding in sequence to Region I of the first DO; a Region IT, or 
alternatively, Region II for allele discrimination, wherein, when Region II is 
comprised in the first DO, the second DO must comprise Region IT or, 
alternatively, when Region IT is comprised in the first DO, the second DO must 
comprise Region II; and a Region III', wherein said Region III' comprises a 
DNA sequence which corresponds to the part of the sequence of the SNP 
locus located upstream of the polymorphic single nucleotide site and which 
consists, at its 3' terminus, of a single nucleotide complementary to the single 
nucleotide specific for either the second allelic state or, alternatively, for the 
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first allelic state of said SNP-locus, wherein, when said Region III is specific for 
said first allelic state, said Region III' is specific for said second allelic state or, 
alternatively, when said Region III is specific for said second allelic state, said 
Region III' is specific for said first allelic state of said SNP-locus; and wherein 
said third DO is 5'-phosphorylated and comprises, in 5' to 3' orientation, a 
Region IV for locus specific hybridization, wherein said Region IV comprises a 
DNA sequence which corresponds in sequence to the part of the sequence of 
the SNP locus located downstream of the polymorphic single nucleotide site 
and which, at its 5' terminus is located immediately downstream of the 3' 
terminus of the first or second DO; and a Region V comprising a DNA 
sequence for PCR amplification; and, optionally, a Region VI for locus 
discrimination, wherein said Region VI comprises a DNA sequence specific for 
each DO-set and which is linked at its 5' terminus to the 3' terminus of Region 
IV and at its 3' teminus to the 5' terminus of Region V; and wherein said 
Regions 1/1' and ll/ll' are invariable in sequence in all said first and second 
detector oligonucleotides in all DO-sets and said Region V is invariable in 
sequence in all said third detector oligonucleotides in all DO-sets, and wherein 
said Region VI is not identical in sequence in the different DO-sets. 
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2) Amplification with common primers; 
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Figure 13 
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SEQUENCE LISTING 

<110> Max-planclc-Gesellschaft zur Forderung der wissenschaften e.V. 

<12Q> Li gati on-based method of analysis of single nucleotide polymorphisms on 
genomic DNA 

<130> Hl721|PCT 



<160> 129 



<170> Patentln version 3.1 



<210> 1 

<211> 28 

<212> DNA 

<213> Arti f i ci al Sequence 



<220> 

<2f-3> o^fgonucTeotn'de 

<400> 1 „ 
acgggaactt tcttgtttta agtcctcg 

<210> 2 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide 

<400> 2 _ 0 
acgggaactt tcttgtttta agtcctct 

<210> 3 

<211> 30 

<212> DNA 

<213> Artificial sequence 



BNSDOCID: <WO 2005026389A2_L> 



WO 2005/026389 



PCT/EP2004/009546 



2/35 



<220> 

<223> ol i gonucl eoti de 

<400> 3 

tgttaccatt ttcatctacg aacttcggtc 

<210> 4 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Ol i gonucl eoti de 

<400> 4 - 7 
cgaagttcgt agatgaaaat ggtaaca 

<210> 5 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Oligonucleotide 

<400> 5 9Q 
acgggttcac tttgagtaat gcctgtcta ^ 

<210> 6 

<211> 29 

<212> DNA 

<213> Artificia!! sequence 



<220> 

<223> Oligonucleotide 

<400> 6 9Q 
acgggttcac tttgagtaat gcctgtctg zy 

<210> 7 
<211> 35 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 7 

caatctgtga tatataaaag accaaactat cggtc 



35 



<210> 8 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 8 

gaccgatagt ttggtctttt atatatcaca 

<210> 9 

<211> 28 . 

<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Ol i gonucl eoti de 
<400> .9 

acgggaatta caccagattc accaaacg 



28 



<210> 10 

<2U> 28 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 10 

acgggaatta caccagattc accaaacc 



28 
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<21G> 11 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 11 

cgacgcgcct cttccttcgg tc 22 

<210> 12 
<211> .23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 12 

tatcccgtta gaccgaagga aga 23 

<210> 13 

<211> 26 

<2i2> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 

<400> .13 

acgggagaca aagtcatgga cgaagg 26 

<210> 14 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
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<400> 14 

acgggagaca aagtcatgga cgaaga 



26 



<210> 15 
<211> 30 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 
<400> 15 

agacacacat ttcatgttgg tttgtcggtc 



30 



<210> 16 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

* 

<400> 16 

acaaaccaac atgaaatgtg tgtct 



25 



<2l0> 17 

<211> 28 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 17 

acgggcctga acaccggata caagagat 



28 



<210> 18 

<211> 28 

<212> DNA 

<213> Arti f i ci al Sequence 
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<2-20> 

<223> Ol i gonucl eoti de 

<400> 18 _ c 
acgggtctga acaccggata caagagag *° 

<210> 19 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 19 „ 
ccggcgagga caccagagcg gtc ^ 

<210> 20 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 20 

gttagaccgc tctggtgtcc t ZJ - 

<210> 21 

<211> 27 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 21 97 
acgggcaccg acgcttacag ttatcct z/ 

<210> 22 

<2U> 27 

<212> DNA 
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<213> Artificial Sequence 



<220> 



<223> oligonucleotide 
<400> 22 

acgggcaccg acgcttacag ttatcca 



27 



<210> 23 



<2U> 34 



<212> DNA 



<213> Artificial Sequence 



<220> 



<22 3> Ol i gonu cl eoti de 
<400> 23 

ctaaagagtt gcaaatacat attcatgctc ggtc 



34 



<210> 24 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 24 

gagcatgaat atgtatttgc aactctttag 30 

<210> 25 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 25 

acgggtaaag acaaatgtgc tcttgat 27 



<210> 26 
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<211> 27 
<212:> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 26 ' 
acgggtaaag acaaatgtgc tcttgaa 27 

<210> 27 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 27 

gggaaacaaa ccatcaaatc tgctcggtc 29 

<210> 28 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide 

<400> 28 

gcagatttga tggtttgttt ccc 23 

<210> 29 

<211> 26 

<212:> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 29 



BNSDOCID: <WO 
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* 
» 



acgggttagg acaaaggtag caacaa 26 

<210> 30 

<211> 26 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 30 

acggggtagg acaatggtag caacac 26 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 31 

atcagtattc agaattggag atgacatcgg tc 32 

<210> 32 

<211> ZS 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 32 

gatgtcatct ccaattctga atactgat 28 

<210> 33 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> Ol i gonucl eoti de 

<400> 33 

acgggtgaaa gatccaaccg cca 23 

<210> 34 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Ol i gonucl eoti de 

<400> 34 

acgggtgaaa gatccaaccg ccg 23 

<210> 35 . 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 35 

cttgcaacta cactacacaa ctctgaaata cggtc 35 

<210> 36 

<211> 30. 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide 

<400> 36 

ttagaccgta tttcagagtt gtgtagtgta 30 

<210> 37 

<211> 28 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 37 OQ 
acgggcattg aactgaggca atgtagtc ^ 

<210> 38 
<211> 28 . 
<212> DNA 

* 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 38 

acgggcattg aactgaggca atgtagtt « 

<210> 39 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 39 ' - 

atcatagtgc tagccatttt cttagttgat cggtc " 

<210> 40 . 

<211> 35 

<212> . DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 40 ^ _ 

atcatagtgc tagccatttt cttagttgat cggtc « 

<210> 41 
<211> 30 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 

<400> 41 

acgggatggg ataacatttt agtgcataac 

<210> 42 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 

<400> 42 3A 
acgggatggg ataacatttt agtgcataag ^ u 

<210> 43 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 

» 

<223> Ol i gonucl eoti de 

<400> 43 

aaagttttag acttaaggtt gaaaaaacga cggtc « 

<210> 44 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 44 . - 

ccgtcgtttt ttcaacctta agtcta ^° 



BNSDOCID: <WO 



2005026389A2_L> 



WO 2005/(126389 

<21D> 45 

<211> 25 

<212> DNA 

<213> Arti f i ci al sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 45 

acgggcgtct cgaaccgcat tcacc 

<210> 46 

<211> 25 

<212> DNA 

<213> Artificial Sequence 

* 

<220> 

<223> Ol i gonucl eoti de 
<400> 46 

acgggcgtct cgaaccgcat tcact 

<210> 47 

<211> 21 

* 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 
<400> 47 

gtcgctccga aattcgcggt c 

<210> 48 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 
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25 



25 



21 
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<400> 48 

cgcgaatttc ggagcgac 

<210> 49 

<211> 30 

<212> DNA 

<213> Artificial sequence 



18 



<220> 

<223> Ol i gonucl eoti de 

<400> 49 , n 
acgggcttac agcagctcaa acttcaaata DU 

<210> 50 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 50 ™ 
acgggcttac agcagctcaa acttcaaatg DU 

<210> 51 

<2U> 30 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide 

<400> 51 ™ 
tcgtagtgct gagcattaag tacctcggtc 5U 

<210> 52 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> oligonucleotide 

<400> 52 

cgttagaccg aggtacttaa tgctca ^° 

<210> 53 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 53 , c 
acgggccttg ttctttgaga ccgagg zt) 

<210> 54 

<211> 26 

<2-12> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 54 

acgggccttg ttctttgaga ccgaga 

<210> 55 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide 

<400> 55 _ . 

gcagatcgga cgtttttgcc ggtc ^ 

<210> 56 
<211> 19 

* 

<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 56 

ggcaaaaacg tccgatctg 

<210> 57 

<211> 24 

<212> DNA . 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 57 

acgggaagcg gagaaagtga gacg 

<210> 58 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 58 

acgggaagcg gagaaagtga gaca 

<210> 59 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide 
<400> 59 

gtaaggatct ctccggtgtc ggtc 
<210> 60 



19 



24 



24 



24 
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<213> Artificial sequence 



PCT/EP2004/009546 



1 7/35 



<220> 

<223> ol i gonucl eoti de 
<400> 60 

cgacaccgga gagatcctta c 



21 



<210> 61 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Oligonucleotide 

<400> 61 

acgggcaaga tatgccgtaa ctgagttt 

<210> 62 

<211> 28 

<212> DNA 

<213> ArrificiaT sequence 



28 



<220> 

<223> Ol i gonucl eoti de 
<400> 62 

acgggcagga tatgccgtaa ctgagttc 
<210> 63 

> 

<211> 32 
<212> DNA 

<213> Artificial sequence 



28 



<220> 

<223> Ol i gonucl eoti de 
<400> 63 



BNSDOCID: <WO 200502 6389A2_I > 



WO 2005/026389 PCT/EP2004/009546 

18/35 



gctatttaac cagcaactgt atctatgcgg tc 

<210> 64 

<211> 27 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide 

<400> 64 

cgcatagata cagttgctgg ttaaata 

<210> 65 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 65 

acgggtcatc aagcttcgtc aggaat 

• 

<210> 66 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 66 

acgggtcatc aagcttcgtc aggaac 

<210> 67 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



32 



27 



26 



26 



<220> 



BNSDCXID: <WO. 



2005026389A2_I_> 



WO 2005/026389 

<223> Ol i gonucl eoti de 
<400> 67 

gtccactcgg atgtcaccgg tc 

<210> 68 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 68 

tagaccggtg acatccgagt 

<210> 69 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 69 

acgggtcact taaccatcat caaccct 

<210> 70 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 70 

acgggtcact taaccatcat caaccca 

<210> 71 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



PCT/EP2004/009546 

19/35 



22 



20 



27 



27 



BNSDOCID: <WO 2005026389A2_I_> 



WO 2005/026389 PCT/EP2004/009546 

20/J5 



<220> 

<223> oligonucleotide 

<400> 71 • . 34 

tcgaattagt tcctagaaac ctattatatc ggtc 

<210> 72 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 72 

ctatcccgtt agaccgatat aataggtttc 

<210> 73 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<2 2 3> Ol i gonu cl eoti de 

<400> 73 

acgggccact gtgagtgccc ttgac 

<210> 74 

<211> 25 . 

<212> DNA . 

<213> Artificial sequence 



<220> 

<223> Oligonucleotide 
<400> 74 

acgggccact gtgagtgccc ttgat. 

<210> 75 
<211> 34 



30 



25 



25 



BNSDOCID: <WO 2005026389A2J_> 



WO 2005/026389 



21/35 



PCT/EP2004/009546 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 75 . OA 
ttgattatat attcggattg tgttgttttc ggtc 34 

<210> 76 

<211> 30 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 76 

gaaaacaaca caatccgaat atataatcaa 

* 

<210> 77 
<211> 29 
<212> DNA .. 

<213> Artificial Sequence 



30 



<220> 

p 

<223> . Oligonucleotide 

<400> 77 0Q 
acgggcgaag agagtaacga acgatgact " 

<210> 78 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 78 

acgggcgaag agagtaacga acgatgacc ^ 



BNSDOCID: <WO 2005026389A2 J_> 



WO 2005/026389 PCT/EP2004/009546 

22/35 



<210> 79 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 

<400> 79 

ggtttacttt caaattcaga gaagaaggac ggtc 34 

<210> 80 

<211> 27 

<212>. DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 80 

cgtccttctt ctctgaattt gaaagta 27 

<210> 81 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 81 

acgggcacaa accattaaga aatatcct 28 

<210> 82 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 



BNSDOCID: <WO 2005026389A2_I_> 



WO 2005/026389 



PCT/EP2O04/O09546 



23/35 



<400> 82 

acgggcaaca aaccattaag aaatatccc 



29 



<210> S3 
<2U> 33 
<212> DNA 

<213> . Arti f i ci al Sequence 



<220> 

<223> 01 i gonucl eoti de 
<400> 83 

agttgtgtct caaaaactca ccatctttcg gtc 



33 



<210> 84 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 84 

cgaaagatgg tgagtttttg agaca 



25 



<210> 85 

<211> 28 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 85 

acgggctctc aagatgtctc tcggtgta 



28 



<210> 86 

<211> 28 

<212> DNA 

<213> Artificial Sequence 



BNSDOCID: <WO. 



2005026389A2_I_> 



WO 2005/026389 



PCT/EP2004/009546 



24/35 



220> 

223> ol i gonucl eoti de 

400> 86 

icgggctctc aagatgtctc tcggtgtc 

:210> 87 

;211> 22 

;212> DNA 

;213> Artificial Sequence 



c220> 

<223> oligonucleotide 

<400> 87 

ttcaccgctt tcagcatcgg tc 

<210> 88 

<211> 21 . 

<212> DNA 

<213> Artificial Sequence 



<220>- 

<223> Oligonucleotide 

<4O0s 88. 

ccgttagacc gatgctgaaa g 

<210> 89 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide 

<400> 89 

acgggaatcc atgttattgt tcttggtttc 

<210> 90 

<211> 30 

<212> DNA 



28 



22 



21 



30 



BNSDOCID: <WO 2005026389A2_I_> 



WO 2005/026389 



<213> Artificial Sequence 



PCT/EP2004/009546 



25/35 



<220> 

<223> Ol i gonucl eoti de 
<400> 90 

acgggaatcc atgttattgt tcttggtttt 



30 



<210> 91 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 
<400> 91 

tctgctgctg tcattacaga cgttcggtc 



29 



<210> 92 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

■ 

<400> 92 

ccgaacgtct gtaatgacag ca 



22 



<210> 93 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 93 

acgggacacg gtggcgcaat tt 



22 



<210> 94 



BNSDOCID: <WO 



2005026389A2_I_> 



WO 2005/026389 



PCT/EP2004/009546 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 94 y 
acgggacacg gtggcgcaat tg 

<210> 95 

<211> 23 

<212> DNA . 

<213> Artificial Sequence 



<220> 

* 

<223> Oligonucleotide 

<400> 95 „ 
agctggccgt tccagtggcg gtc « 

<210> 96 

<211> 18 

<2l2>. DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 96 

ccactggaac ggccagct 

<210> 97 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 97 



18 



BNSDOCID: <WO. 



2005026389A2 I > 



WO 2005/026389 



PCT/EP2004/009546 



27/35 



acgggcggcg aacatttaag accaaactg 



29 



<210> 98 
<211> 29 



<212> DNA 



<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 
<400> 98 

acgggcggcg aacatttaag accaaacta 



29 



<210> 99 

<211> 45 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 99 

ttgtaaaacg tcgggagaaa cagagaggtc taacgggata gcgtg 



45 



<210> 100 

<211>- 26 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 100 

tctctgtttc tcccgacgtt ttacaa 



26 



<210> 101 

<211> 52 

<212> DNA 

<213> Artificial sequence 



<220> 



BNSDOCID: <WO 2005026389A2J_> 



WO 2005/026389 PCT/EP2004/009546 

28/35 



<223> Oligonucleotide 

<400> 101 

ctctctgttt ctcccgacgt tttacaacag tttggtctta aatgttcgcc gc 52 

<210> 102 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> pi i gonucl eoti de 

<400> 102 

acgggagcta aatcgagatc gttaaac 27 

<210> 103 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 103 

acgggagcta aatcgagatc gttaaaa 27 

<210> 104 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400>. 104 

cctccgaaga cttcacggta gtcagtctaa cgggatagcg tg 42 

<210> 105 

<211> 23 

<212> DNA 

<213> Artificial sequence 



BNSDOCID: <WO. 



2005026389A2J_> 



WO 2005/026389 



PCT/EP2004/009546 



29/35 



<220> 

<223> ol i gonuel eoti de 
<400> 105 

cggttttgac taccgtgaag tct 



23 



<210> 106 

<2U> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 

<400> 106 

acgggagatt ggagctccta tggcta 

<210> 107 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



26 



<220> 

<223> Ol i gonuel eoti de 

<400> 107 

acgggagatt ggagctccta tggctg 

<210> 108 

<211> 32 

<212> DNA 

<213> Artificial sequence 



26 



<220> 

<223> Ol i gonuel eoti de 
<400> 108 

acccctctta cctttgttat taccacacgg tc 



32 



<210> 109 
<211> 27 



BNSDOCID: <WO 2O05026389A2_l _> 



WO 2(105/026389 PCT/EP2004/009546 

30/35 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 109 - 7 
tgtggtaata acaaaggtaa gaggggt " 

<210> 110 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 110 „ 
acgggaaccg aaaggcgact ate " 

<210> 111 

<211> 23 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 111 ^ ^ 

acgggaaccg. aaaggcgact atg " 

<210> 112 

<211> 34 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 112 

gaggaaataa cagatgtgat gttgaagatc ggtc 



BNSDOCID: <WO 2005026389A2_I_> 



WO 2005/026389 PCT/EP2004/009546 

31/35 

c210> 113 
c211> 28 
c212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 113 28 
tcttcaacat cacatctgtt atttcctc 

<210> 114 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 
<400> 114 

acgggtttta attgatagct ggagata 

<210> 115 

<211> 27 

<212> DNA • 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 115 

acgggtttta attgatagct ggagatg 

<210> 116 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



BNSDOCID: <WO 2005026389A2_I_> 



\ 



WO 2005/026389 PCT/EP2004/009546 

32/35 



<400> 116 „ 
tattctgaaa acagagatag atggtgcacg gte « 

<210> 117 

<2ll> 28 

<212> DNA 

<213> Artificial Sequence- 



<220> 

<223> oligonucleotide 

<400> 117 

tgcaccatct atctctgttt tcagaata 

<210> 118 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl eoti de 

<400> 118 

acgggaacag caaaatgttt ctagtcg 

<21Q>- 119 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 119 

acgggaacag caaaatgttt ctagtcc 

<210> 120 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



28 



27 



27 



BNSDOCID: <WO 2005026389A2_I_> 



WO 2(105/026389 



PCT/EP2004/009546 



<220> 

<223> Ol i gonucl eoti de 
<400> 120 

acaaccttgt tcagatgttt cgcttcggtc 



30 



<210> 121 

<211> 22 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 121 

cgaagcgaaa catctgaaca ag 

<210> 122 

<213> 42 • 

<212> DNA 

<213> Artificial sequence 



22 



<220> 

<223> Ol i gonucl eoti de 

<400> 122 ^ " 

tgcacaattc acgactcacg atccacacgg tctcgcactg gc 



42 



<210> 123 

<211> 15 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 123 
cccgtgccag tgcga 

X 

<210> 124 
<211> 42 
<212> DNA 



BNSDOCID: <WO 20O5026389A2J_> 



WO 2005/026389 



PCT/EP2004/009546 
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<213> Artificial Sequence. 



<220> 

<223> Oligonucleotide 
<400> 124 

tgcacaattc acgactcacg atcatccgct ccgacgacac ga 



%2 



<210> 125 

<211> 16 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ol i gonucl epti de 

<400> 125 -~ 
cccgttcgtg tcgtcg xt> 

<210> 126 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<22Q>- 

<223> Ol i gonucl eoti de 

<400> 126 

taacgggata gcgtggtggt a ^ x 

<210> 127 

<211> 17 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 127 17 

gctatcccgt tagaccg x/ 

<210> 128 



BNSDOCID: <WO 20O5026389A2_L> 



WO 2005/026389 PCT/EP2004/009546 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 

<400> 128 

gcacaattca cgactcacga 20 

<210> 129 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Ol i gonucl eoti de 
<400> 129 

ccaccacgct atcccgttag ac 22 
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